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CHLAMYDIA TRACHOMATIS GENOMIC SEQUENCE AND POLYPEPTIDES, 

FRAGMENTS THEREOF AND USES THEREOF, IN PARTICULAR 
FOR THE DIAGNOSIS, PREVENTION AND TREATMENT OF INFECTION 

5 

The subject of the invention is the genomic sequence and the nucleotide sequences 
encoding polypeptides of Chlamydia trachomatis, such as cellular envelope polypeptides, which are 
secreted or specific, or which are involved in metabolism, in the replication process or in virulence, 
polypeptides encoded by such sequences as well as vectors including the said sequences and cells or 

10 animals transformed with these vectors. The invention also relates to transcriptional gene products of 
the Chlamydia trachomatis genome, such as, for example, antisense and ribozyme molecules, which 
can be used to control growth of the microorganism. The invention also relates to methods of 
detecting these nucleic acids or polypeptides and kits for diagnosing Chlamydia trachomatis 
infection. The invention also relates to a method of selecting compounds capable of modulating 

15 bacterial infection and a method for the biosynthesis or biodegradation of molecules of interest using 
the said nucleotide sequences or the said polypeptides. The invention finally comprises, 
pharmaceutical, in particular vaccine, compositions for the prevention and/or treatment of bacterial, in 
particular Chlamydia trachomatis, infections. 

The genus Chlamydia is composed of four species: Chlamydia psittaci, Chlamydia 

20 pecorum, Chlamydia pneumoniae and Chlamydia trachomatis. 

Chlamydia psittaci comprises numerous species, whose hosts are terrestrial vertebrate animals as well 

as birds and occasionally humans; 

Chlamydia pecorum is a pathogen of ruminants; 

Chlamydia pneumoniae is responsible for pneumopathies, for sinusitis and for arterial impairments in 
25 humans; 

Chlamydia trachomatis (Ct) is responsible for a large number of human diseases: 

eye diseases: conventional trachoma, nonendemic trachoma, paratrachoma, inclusion 
conjunctivitis in neonates and in adults; 

genital diseases: nongonococcal uretritis, epididymitis, cervicitis, salpingitis, perihepatitis and 
30 bartholinitis as well as pneumopathy in breast-feeding infants; 

systemic diseases: venereal lymphogranulomatosis (LGV). 
These diseases affect a very large number of women and men [more than 600 million individuals are 
trachoma carriers and there are more than 90 million cases of genital Chlamydia infections] 
worldwide. Accordingly, basic and applied research which makes it possible to understand the 
35 physiopathology linked to this bacterium is very important for public health. (Raulston JE., 1995; 
Hackstadt T. et al., 1996). 

Eye impairments due to Chlamydia trachomatis cause trachoma and inclusion 
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conjunctivitis. Trachoma is a chronic conjunctivitis. It is the major cause of curable eye diseases 
leading to blindness. It is estimated that 20 million cases of loss of sight are due to it worldwide. 
Moreover, inclusion conjunctivitis is an eye inflammation which is caused by Chlamydia trachomatis 
and is transmitted by the venereal route. Inclusion conjunctivitis affects adults and neonates exposed 
5 to genital secretions. 

Two types of eye disease caused by agents of the species Chlamydia trachomatis can 
be distinguished. The conventional trachomatous disease is found in endemic regions; transmission 
occurs from eye to eye and through the hands, or it can be passed on by flies. In nonendemic regions, 
transmission occurs through the genital apparatus; it usually only causes conjunctivitis, most often 

10 without associated keratitis; it is rare for a pannus or for scars similar to those in trachoma to develop. 
This conjunctival impairment is called paratrachoma to differentiate it from the conventional endemic 
trachoma which is transmitted by the ocular route. The seriousness and the number of cases of 
trachoma have decreased over the last forty years. This is related to the improvement in hygiene and 
living conditions. However, trachoma remains the principal cause of avoidable blindness in Africa, in 

15 the Middle East and in some regions of Asia. The transmission of the endemic disease occurs in 
particular through close personal contact, in regions where a secondary exposure exists in a repeated 
form. Often, the infection is also latent. In some industrialized countries, such as the United States, a 
mild form of trachoma still exists in some ethnic groups. Sometimes, a tardive trachoma may be found 
following an immunosuppressive treatment. The eye impairments caused by Chlamydia trachomatis, 

20 such as inclusion conjunctivitis and paratrachoma, are also a complication due to a common venereal 
infection. These infections are not very frequent; they occur most often in young adults. The eye 
impairments in neonates are produced during the passage through the maternal genital routes during 
childbirth. Theoretically, endemic trachoma and inclusion conjunctivitis in adults appear in the form 
of conjunctivitis, the latter being characterized by the presence of lymphoid follicles. In regions where 

25 the endemic disease is serious, the disease often starts before the age of 2 years and reinfection is 
frequent. Superficial neovascularization is added, in this case, to leukocytic infiltration. The 
conjunctival scars will then cause trichiasis and entropion. The eroded cornea will become a carrier of 
a corneal ulcer of bacterial origin. The scar on the cornea causes blindness. Impairment of the 
lachrymal glands gives a picture of dryness of the cornea. Xerosis becomes complicated with 

30 secondary bacterial ulcer. In regions where trachoma is endemic, the infectious process disappears 
towards the age of fifteen. The scars then progress to blindness, which affects almost exclusively 
adults. In regions where exposure is lower, the infectious process is, in this case, less rapid and adults 
are carriers of a chronic disease. 

Positive diagnosis of trachoma can be most often established by clinical observation: 

35 lymphoid follicles are visible on the upper tarsal conjuctiva; conjunctival scar is typical. Vascular 
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pannus exists. In endemic regions, clinical diagnosis is often sufficient. However, isolated cases of 
inclusion conjunctivitis must be the subject of a differential diagnosis, in particular to distinguish 
viral conjunctivitis. 

Public health measures against the endemic form of the disease provide for mass 
5 treatments with tetracycline or erythromycin collyria of all children. The treatment may also provide 
for surgical correction of the lesions. The other conjunctival impairments respond well to general 
treatments with tetracyclines or erythromycin. The prevention of trachomatous disease by health 
measures and by improving living standards is sufficient. Furthermore, to avoid the spread of 
trachoma, antibiotic collyria may be used. 

10 The role of Chlamydia trachomatis in a number of genital impairments has been 

demonstrated over the last three decades. Chlamydia trachomatis is responsible in this case for a 
pathology which may be superposed on the impairments observed with Neisseria gonorrhoeae. The 
pathologies for which Chlamydia trachomatis may be responsible at the genital level are acquired by 
the venereal route and are a major source of sexually transmitted diseases. 

15 Tn e epidemiology of Chlamydia trachomatis genital infections shows each year more 

than 4 million new cases in the United States, and more than 3 million new cases in Europe. Like the 
other venereal infections, Chlamydia trachomatis affects young subjects. There is a direct relationship 
between the number of sexual partners and the frequency of the disease. For example, the frequency 
of Chlamydia trachomatis appears to be five to ten times higher than that of Neisseria gonorrhoeae in 

20 pregnant women. The Chlamydia trachomatis infection is probably more discreet than its Neisseria 
gonorrhoeae homologue. This relative clinical silence, estimated in women at 50% or even 70% of 
infections, explains why the total morbidity of Chlamydia trachomatis conditions is high. Diagnosis 
must therefore be requested in patients who are sometimes asymptomatic carriers of infection. 

Chlamydia trachomatis is responsible for nearly 30% of nongonococcal urethritis, or 

25 NGU. Chlamydia trachomatis urethritis may be discreet, the disease then progresses to a certain form 
of chronicity. The diagnosis will, like for the other clinical forms of the disease, be called into play 
later. 

Chlamydia trachomatis is a cause of epididymitis in humans during a period of sexual 
activity. The bacterium may be found in the urethra, urine, sperm or even a sample collected by 
30 aspiration from the epididymis. It is in particular found in humans under 35 years of age. A discharge 
from the urethra which is associated with the disease suggests the diagnosis of a Chlamydia condition 
or sometimes a gonococcal condition. 

Untreated Reiter's syndrome, if accompanied by urethritis, evokes a Chlamydia 
trachomatis condition. 

35 Chlamydia trachomatis affects 30% to 40% of women who are clinically carriers of a 
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gonorrhoea (or have had contact), 10% to 20% of women having a venereal origin, 5% of women 
consulting having no particular origin. 

The cervix is often normal during a Chlamydia trachomatis infection/However, a 
hypertrophic cervical erythema will cause such an infection to be suspected. Chlamydia trachomatis 
5 is responsible for an endocervicitis whereas viral impairments result in exocervicitis. A 
nongonococcal endocervicitis requires treating the patient and partners with tetracyclines. 

Chlamydia trachomatis is responsible for a large number of acute salpingites. The 
picture is often complicated by an acute peritonitis or even a perihepatitis. 

In case of pregnancy, the risk is first that of infection of the neonate at birth. 

10 However, the risk of postpartum complications exists (endometritis or salpingitis). 

The reference method for the diagnosis of Chlamydia trachomatis is the isolation of 
the bacterium on cell culture. For all infections, the sample collection should make it possible to 
obtain a suitable sample with the aid of a swab. This sample should be transported to a laboratory 
under excellent conditions; in particular, the cold chain must absolutely be maintained. The placing in 

15 cell culture on mouse fibroblasts will be carried out by people having specific skills. The distinction 
of Chlamydia trachomatis with labelled antibodies and the observation of cell cultures under a 
microscope will take place two days after placing in culture. Provided these imperatives are observed, 
cell culture is a reliable technique. However, the constraints linked to this technique are many: not 
only must the laboratory be equipped for the cell culture, but, furthermore, highly competent staff 

20 must take care of this type of diagnosis. 

Techniques for identifying genetic material can obviously be used for the detection of 
Chlamydia trachomatis. Among these techniques, enzymatic gene amplification or PCR is favoured 
by those skilled in the art. The technique indeed makes it possible to identify Chlamydia trachomatis 
with a very high sensitivity and complete specificity. Initially used in specialist laboratories, PCR is 

25 now performed in numerous medical laboratories. This diagnostic approach is important because it 
allows detection of the bacteria even in samples which have been transported under poor conditions. 

The treatment of Chlamydia urethritis with antibiotics such as tetracycline or 
quinolones is very effective. The duration of treatment varies between 7 and 14 days. The treatment of 
pregnant women poses the problem of contraindications to tetracycline. 

30 Neonatal infections caused by Chlamydia trachomatis are explained by the frequency 

of these bacteria in the cervix. In some studies, 5% to 13% of impairments are observed in the cervix 
in asymptomatic pregnant women. The neonates risk, in this case, developing an inclusion 
conjunctivitis. Not only can Chlamydia trachomatis be isolated from the children ' s eyes, but also 
persistently from the rhinopharynx and also from the rectum. Pneumopathies and otitis media are also 

35 found, a result of contamination at childbirth. 



5 

Differential diagnosis of inclusion conjunctivitis in neonates is required with 
gonococcal ophthalmia; while the duration of incubation is from one to three days in the case of a 
gonococcal ophthalmia, neonatal inclusion conjunctivitis has an acute beginning with discharge and 
formation of membranes or even of conjunctival scars. 
5 Treatment consists of oral erythromycin at the dose of 40 to 50 mg per kg of weight, 

for two to three weeks. In a nonendemic trachoma region, this disease never progresses to chronicity. 

Finally, mention should be made of infantile pneumopathy. The syndrome is well 
defined; it is found in children affected by Chlamydia trachomatis. Less than ten children are affected 
by Chlamydia trachomatis pneumopathies per thousand births. The syndrome is, in this case, always 

10 found at an early age (less than four months). 

Venereal lymphogranulomatosis is an infection which is transmitted through sexual 
contact and is due to Chlamydia trachomatis strains LI, L2 and L3. In humans, a passing primary 
genital lesion is followed by an often suppurative and multiple regional lymphadenopathy. This 
disease is a general disease which is accompanied by fever and a rise in the number of white blood 

15 cells. If it progresses to chronicity, the disease then becomes complicated with genital elephantiasis, 
stricture or even fistula of the genital apparatus, of the penis, of the urethra and of the rectum. 

The three Chlamydia trachomatis strains LI, L2 and L3 are responsible for venereal 
lymphogranulomatosis. These Chlamydia strains are more virulent than the strains responsible for 
trachoma and STD. It is very important to note that venereal lymphogranulomatosis is a systemic 

20 disease which affects primarily the lymphatic tissue. Generally transmitted by the sexual route, 
Chlamydia trachomatis L may also cause contamination through direct contact or even during poor 
laboratory handling. In spite of these variable modes of transmission, the age for the highest incidence 
of these diseases corresponds to that for greater sexual activity. Venereal lymphogranulomatosis is 
still endemic in South America, in Africa and sometimes in Asia. For a long time, the prevalence of 

25 venereal lymphogranulomatosis was difficult to establish because of the difficulty of performing 
diagnosis with certitude. It should also be noted that men are affected more often than women. In low 
endemic regions, it is difficult to recognize the reservoir of microbes. This situation is explained by 
the fact that the isolation of the strains causing venereal lymphogranulomatoses from asymptomatic 
subjects is rarely successful. 

30 Clinical impairment by venereal lymphogranulomatosis manifests itself by the 

appearance of a small ulcer 3 to 21 days after the exposure of small nonpainful vesicles. In both men 
and women, the lesion is most often silent. Since this impairment disappears within a few days and 
causes no functional discomfort and leaves no visible scar, the disease is often recognized late. The 
venereal lymphogranulomatosis strains may be found in the urethra or the endocervix in patients with 

35 inguinal adenopathies; these regions are then considered as the initial site of infection. The 
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characteristic feature of the venereal lymphogranulomatosis strains is that from the initial site of 
infection, Chlamydia exhibits a diffusion drained by the lymphatic ducts. The disease is then 
complicated by a ganglionic impairment of the region draining the site of inoculation. By way of 
example, anorectal infection causes deep adenopathies. These adneopathies are marked by the 
5 appearance of a periadenitis which forms a fluctuating and suppurative ganglionic mass. Fistulae will 
appear during the decline of the disease. As general signs are present at this stage of the disease, it is 
often confused with a malignant lymphoma. The other general complications are rarely observed. 
Clinical examinations have been able to lead biologists to isolate Chlamydia from the cerebrospinal 
fluid or from the blood. It should also be noted that in a number of cases (5%), venereal 

10 lymphogranulomatosis is complicated by a chronic oedema: this is genital elephantiasis. 

The diagnosis of venereal lymphogranulomatosis requires the isolation of the 
Chlamydia strains involved in the disease. However, isolation on cell cultures is rarely used, but 
immunological reactions may be used. 

The treatment of venereal lymphogranulomatosis in its initial phase is identical to the 

15 treatment of other Chlamydia infections. In the chronic phases, antibiotics have little effect on the 
progress of the disease, but they are however useful in case of superinfection. Although the 
recommended therapeutic arsenal is identical, it is advisable to prolong the treatment for a period of at 
least four weeks. In addition to this treatment, reconstructive surgery may be useful in cases of 
urethra], penile or rectal strictures, as well as for the treatment of fistulae. 

20 In conclusion, a short and effective treatment, without recurrences, and a well- 

tolerated treatment of Chlamydia trachomatis infections therefore remains desirable. 

An even greater need up until now relates to a diagnosis which is specific to each of 
the strains, which is sensitive, which can be carried out conveniently and rapidly, and which allows 
early detection of the infection. 

25 No vaccine is currently available against Chlamydia trachomatis. The role of the 

immune defense in the physiology and pathology of the disease should probably be understood in 
order to develop satisfactory vaccines. 

More detailed information relating to the biology of these strains, their interactions 
with their hosts, the associated phenomena of infectivity and those of escaping the immune defenses 

30 of the host in particular, and finally their involvement in the development of the these associated 
pathologies, will allow a better understanding of these mechanisms. In the light of the preceding text 
which shows in particular the limitations of the means of controlling Chlamydia trachomatis 
infection, it is therefore at present essential, on the one hand, to develop molecular tools, in particular 
from a better genetic knowledge of Chlamydia trachomatis, but also to develop new preventive and 

35 therapeutic treatments, new diagnostic methods and new vaccine strategies which are specific, 
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effective and tolerated. This is precisely the object of the present invention. 

The subject of the present invention is the nucleotide sequence having the sequence 
SEQ ID No. 1 of the Chlamydia trachomatis LGV2 genome. However, the invention is not limited to 
SEQ ID No. 1, but encompasses genomes and nucleotides encoding polypeptides of strain variants, 
5 polymorphisms, allelic variants, and mutants. 

Thus, the subject of the present invention encompasses nucleotide sequences 
characterized in that they are chosen from: 

a) the nucleotide sequence of SEQ ID No. 1, a nucleotide sequence exhibiting at least 
99.9% identity with the sequence SEQ ID No. 1 , the nucleotide sequence of the genomic DNA 

10 contained within ECACC Deposit No. 98112618, the nucleotide sequence of a clone insert within 
ECACC Deposit No. 981 12617 (these being provisional deposit numbers); 

b) a nucleotide sequence homologous to the sequence SEQ ID No. 1 ; 

c) a polynucleotide sequence that hybridizes to the nucleotide sequence of a) under 
conditions of high or intermediate stringency as described below: 

15 (i) By way of example and not limitation, procedures using conditions of high stringency are 

as follows: Prehybridization of filters containing DNA is carried out for 8 h to overnight at 65°C in 
buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 
0.02% BSA, and 500 u;g/ml denatured salmon sperm DNA. Filters are hybridized for 48 h at 65°C, 
the preferred hybridization temperature, in prehybridization mixture containing 100 |ig/ml denatured 

20 salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Alternatively, the hybridization step 
can be performed at 65°C in the presence of SSC buffer, 1 x SSC corresponding to 0.1 5M NaCl and 
0.05 M Na citrate. Subsequently, filter washes can be done at 37°C for 1 h in a solution containing 
2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X SSC at 50°C for 45 
min. Alternatively, filter washes can be performed in a solution containing 2 x SSC and 0.1% SDS, 

25 or 0.5 x SSC and 0. 1 % SDS, or 0. 1 x SSC and 0. 1 % SDS at 68°C for 15 minute intervals. Following 
the wash steps, the hybridized probes are detectable by autoradiography. Other conditions of high 
stringency which may be used are well known in the art and as cited in Sambrook et al., 1989, 
Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47- 
9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green Publishing Associates 

30 and Wiley Interscience, N.Y. are incorporated herein in their entirety. 

(ii) By way of example and not limitation, procedures using conditions of intermediate 
stringency are as follows: Filters containing DNA are prehybridized, and then hybridized at a 
temperature of 60°C in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50°C and the hybridized probes are 

35 detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
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well known in the art and as cited in Sambrook et al., 1989, Molecular Cloning, A Laboratory 
Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, 
Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. 
are incorporated herein in their entirety. 
5 d) a nucleotide sequence complementary to the sequence SEQ ID No. 1 or 

complementary to a nucleotide sequence as defined in a), b) or c), and a nucleotide sequence of their 
corresponding RNA; 

e) a nucleotide sequence of a representative fragment of the sequence SEQ ID No. 1, or 
of a representative fragment of the nucleotide sequence as defined in a), b), c) or d); 
10 f) a nucleotide sequence comprising a sequence as defined in a), b), c), d) or e); 

g) a nucleotide sequence capable of being obtained from a nucleotide sequence as 
defined in a), b), c), d), e) or f); and 

h) a modified nucleotide sequence of a nucleotide sequence as defined in a), b), c), d), 
e),f)org). 

15 Sequence of the genome, or genomic sequence of Chlamydia trachomatis is 

understood to mean the sequence of the chromosome of Chlamydia trachomatis, in contrast with the 
plasmid sequence of Chlamydia trachomatis. 

Nucleotide sequence, polynucleotide or nucleic acid are understood to mean, 
according to the present invention, either a double-stranded DNA, a single-stranded DNA or products 

20 of transcription of the said DNAs. 

It should be understood that the present invention does not relate to the genomic 
nucleotide sequences of Chlamydia trachomatis taken in their natural environment, that is to say in 
the natural state. They are sequences which may have been isolated, purified or partially purified, by 
separation methods such as, for example, ion-exchange chromatography, molecular size exclusion 

25 chromatography or affinity chromatography, or alternatively fractionation techniques based on 
solubility in various solvents, or by genetic engineering methods such as amplification, cloning or 
subcloning, it being possible for the sequences of the invention to be carried by vectors. 

The nucleotide sequence SEQ ID No. 1 was obtained by sequencing the Chlamydia 
trachomatis LGV2 genome by the method of directed sequencing after fluorescent automated 

30 sequencing of the inserts of clones and assembling of these sequences of nucleotide fragments 
(inserts) by means of softwares (cf. Examples). In spite of the high precision of the sequence 
SEQ ID No. 1, it is possible that it does not perfectly, 100% represent the nucleotide sequence of the 
Chlamydia trachomatis LGV2 genome and that a few rare sequencing errors or uncertainties still 
remain in the sequence SEQ ID No. 1. In the present invention, the presence of an uncertainty for an 

35 amino acid is designated by «Xaa» and that for a nucleotide is designated by «N» in the sequence 
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listing below. These few rare errors or uncertainties could be easily detected and corrected by persons 
skilled in the art using the entire chromosome and/or its representative fragments according to the 
invention and standard amplification, cloning and sequencing methods, it being possible for the 
sequences obtained to be easily compared, in particular by means of a computer software and using 
5 computer-readable media for recording the sequences according to the invention as described, for 
example, below. After correcting these possible rare errors or uncertainties, the corrected nucleotide 
sequence obtained would still exhibit at least 99.9% identity with the sequence SEQ ID No. 1. Such 
rare sequencing uncertainties are not present within the DNA contained within ECACC Deposit No. 
98112617 or 98112618 (provisional numbers) and whatever rare sequence uncertainties that exist 

10 within SEQ ID No. 1 can routinely be corrected utilizing the DNA of the ECACC Deposits. 

Homologous nucleotide sequence for the purposes of the present invention is 
understood to mean a nucleotide sequence having a percentage identity with the bases of the 
nucleotide sequence SEQ ID No. 1 of at least 80%, preferably 90% and 95%, this percentage being 
purely statistical and it being possible for the differences between the two nucleotide sequences to be 

15 distributed randomly and over their entire length. The said homologous sequences exhibiting a 
percentage identity with the bases of the nucleotide sequence SEQ ID No. 1 of at least 80%, 
preferably 90% and 95%, may comprise, for example, the sequences corresponding to the genomic 
sequence or to the sequences of its representative fragments of a bacterium belonging to the 
Chlamydia family, including the species Chlamydia pneumoniae, Chlamydia psittaci and Chlamydia 

20 pecorum mentioned above, as well as the sequences corresponding to the genomic sequence or to the 
sequences of its representative fragments of a bacterium belonging to the variants of the species 
Chlamydia trachomatis. In the present invention, the terms family and genus are mutually 
interchangeable, the terms variant, serotype, strain and subspecies are also mutually interchangeable. 
These homologous sequences may thus correspond to variations linked to mutations within the same 

25 species or between species and may correspond in particular to truncations, substitutions, deletions 
and/or additions of at least one nucleotide. The said homologous sequences may also correspond to 
variations linked to the degeneracy of the genetic code or to a bias in the genetic code which is 
specific to the family, to the species or to the variant and which are likely to be present in Chlamydia. 

Protein and/or nucleic acid sequence homologies may be evaluated using any of the 

30 variety of sequence comparison algorithms and programs known in the art. Such algorithms and 
programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and 
CLUSTALW (Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85(8): 2444-2448; Altschul et 
al, 1990, J. Mol. Biol. 2i5(.?):403-410; Thompson et al, 1994, Nucleic Acids Res. 22^:4673-4680; 
Higgins et al, 1996, Methods Enzymol. 266:383-402; Altschul et al, 1990, J. Mol. Biol. 2]5(3):403- 

35 410; Altschul et al, 1993, Nature Genetics 3:266-272). 
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In a particularly preferred embodiment, protein and nucleic acid sequence homologies 
are evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well known in the 
art (see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA £7:2267-2268; Altschul et al, 
1990, J. Mol. Biol. 275:403-410; Altschul et al, 1993, Nature Genetics 3:266-272; Altschul et al, 
5 1997, Nuc. Acids Res. 25:3389-3402). In particular, five specific BLAST programs are used to 
perform the following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein sequence 
database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence database; 
1° (3)BLASTX compares the six-frame conceptual translation products of a query nucleotide 

sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database 
translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against the 
15 six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar segments, which are 
referred to herein as "high-scoring segment pairs," between a query amino or nucleic acid sequence 
and a test sequence which is preferably obtained from a protein or nucleic acid sequence database. 
High-scoring segment pairs are preferably identified (i.e., aligned) by means of a scoring matrix, 

20 many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 matrix 
(Gonnetera/., 1992, Science 256:1443-1445; Henikoff and Henikoff, 1993, Proteins 77:49-61). Less 
preferably, the PAM or PAM250 matrices may also be used (see, e.g., Schwartz and Dayhoff, eds., 
1978, Matrices for Detecting Distance Relationships: Atlas of Protein Sequence and Structure, 
Washington: National Biomedical Research Foundation) 

25 The BLAST programs evaluate the statistical significance of all high-scoring segment 

pairs identified, and preferably selects those segments which satisfy a user-specified threshold of 
significance, such as a user-specified percent homology. Preferably, the statistical significance of a 
high-scoring segment pair is evaluated using the statistical significance formula of Karlin (see, e.g., 
Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 57:2267-2268). 

30 Nucleotide sequence complementary to a sequence of the invention is understood to 

mean any DNA whose nucleotides are complementary to those of the sequence of the invention, and 
whose orientation is reversed (antiparallel sequence). 

The present invention further comprises fragments of the sequences of a) through h) 
above. Representative fragments of the sequences according to the invention will be understood to 

35 mean any nucleotide fragment having at least 8 successive nucleotides, preferably at least 12 
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successive nucleotides, and still more preferably at least 15 or at least 20 successive nucleotides of the 
sequence from which it is derived. It is understood that such fragments refer only to portions of SEQ 
ID No. 1 that are not currently listed in a publicly available database. 

Among these representative fragments, those capable of hybridizing under stringent 
5 conditions with a nucleotide sequence according to the invention are preferred. Hybridization under 
stringent conditions means that the temperature and ionic strength conditions are chosen such that 
they allow hybridization to be maintained between two complementary DNA fragments. 

By way of illustration, high stringency conditions for the hybridization step for the 
purposes of defining the nucleotide fragments described above, are advantageously the following. 
10 The hybridization is carried out at a preferred temperature of 65°C in the presence of 

SSC buffer, 1 x SSC corresponding to 0.15 M NaCl and 0.05 M Na citrate. The washing steps may be, 
for example, the following: 

2xSSC, 0.1% SDS at room temperature followed by three washes with 1 x SSC, 0.1% SDS; 
0.5 x SSC, 0.1% SDS; 0.1 x SSC, 0.1% SDS at 68°C for 15 minutes. 

15 Intermediate stringency conditions, using, for example, a temperature of 60°C in the 

presence of a 5 x SSC buffer, or of low stringency, for example a temperature of 50°C in the presence 
of a 5 x SSC buffer, respectively require a lower overall complementarity for the hybridization 
between the two sequences. 

The stringent hybridization conditions described above for a polynucleotide of about 

20 300 bases in size will be adapted by persons skilled in the art for larger- or smaller-sized 
oligonucleotides, according to the teaching of Sambrook et al., 1989. 

Among the representative fragments according to the invention, those which can be 
used as primer or probe in methods which make it possible to obtain homologous sequences or their 
representative fragments according to the invention, or to reconstitute a genomic fragment found to be 

25 incomplete in the sequence SEQ ID No. 1 or carrying an error or an uncertainty, are also preferred, 
these methods, such as the polymerase chain reaction (PCR), cloning and sequencing of nucleic acid 
being well known to persons skilled in the art. These homologous nucleotide sequences corresponding 
to mutations or to inter- or intra-species variations, as well as the complete genomic sequence or one 
of its representative fragments capable of being reconstituted, of course form part of the invention. 

30 Among the said representative fragments, those which can be used as primer or probe 

in methods allowing diagnosis of the presence of Chlamydia trachomatis or one of its associated 
microorganisms as defined below are also preferred. 

The representative fragments capable of modulating, regulating, inhibiting or 
inducing the expression of a gene of Chlamydia trachomatis or one of its associated microorganisms, 

35 and/or capable of modulating the replication cycle of Chlamydia trachomatis or one of its associated 
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microorganisms in the host cell and/or organism, are also preferred. Replication cycle is intended to 
designate invasion, multiplication, intracellular localization, in particular retention in the vacuole and 
inhibition of the process of fusion to the lysosome, and propagation of Chlamydia trachomatis or one 
of its associated microorganisms from host cells to host cells. 
5 Among the said representative fragments, those corresponding to nucleotide 

sequences corresponding to open reading frames, called ORF sequences (ORF for open reading 
frame), and encoding polypeptides, such as for example, but without being limited thereto, the ORF 
sequences which will be later described, are finally preferred. 

The representative fragments according to the invention may be obtained, for 

10 example, by specific amplification, such as PCR, or after digestion, with appropriate restriction 
enzymes, of nucleotide sequences according to the invention; these methods are in particular 
described in the manual by Sambrook et al, 1989. The said representative fragments may also be 
obtained by chemical synthesis when they are not too large in size and according to methods well 
known to persons skilled in the art. For example, such fragments can be obtained by isolating 

15 fragments of the genomic DNA of ECACC Deposit No. 98112618 or a clone insert present at this 
ECACC Deposit No. 981 12617 (provisional numbers). 

The representative fragments according to the invention may be used, for example, as 
primer, to reconstitute some of the said representative fragments, in particular those in which a 
portion of the sequence is likely to be missing or imperfect, by methods well known to persons skilled 

20 in the art such as amplification, cloning or sequencing techniques. 

Modified nucleotide sequence will be understood to mean any nucleotide sequence 
obtained by mutagenesis according to techniques well known to persons skilled in the art, and 
exhibiting modifications in relation to the normal sequences, for example mutations in the regulatory 
and/or promoter sequences for the expression of a polypeptide, in particular leading to a modification 

25 of the level of expression of the said polypeptide or to a modulation of the replicative cycle. 

Modified nucleotide sequence will also be understood to mean any nucleotide 
sequence encoding a modified polypeptide as defined below. 

The subject of the present invention also includes Chlamydia trachomatis nucleotide 
sequences characterized in that they are chosen from a nucleotide sequence of an open reading frame 

30 (ORF),thatis,theORF2toORF1197 sequences. 

The ORF2 to ORF1197 nucleotide sequences are defined in Tables 1 and 2, infra, 
represented below by their position on the sequence SEQ ID No. 1. For example, the ORF 10 
sequence is defined by the nucleotide sequence between the nucleotides at position 9828 and 10430 
on the sequence SEQ ID No. 1, ends included. ORF2 to ORF1197 have been identified via 

35 homology analyses as well as via analyses of potential ORF start sites, as discussed in the examples 
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below. It is to be understood that each identified ORF of the invention comprises a nucleotide 
sequence that spans the contiguous nucleotide sequence from the codon immediately 3' to the stop 
codon of the preceding ORF and through the 5' codon to the next stop codon of SEQ ID No.:l in- 
frame to the ORF nucleotide sequence. Table 2, infra, lists the beginning, end and potential start site 
5 of each of ORFs 2-1 197. In one embodiment, the ORF comprises the contiguous nucleotide sequence 
spanning from the potential ORF start site downstream (that is, 3') to the ORF stop codon (or the ORF 
codon immediately adjacent to and upstream of the ORF stop codon). ORF2 to ORF1 197 encode the 
polypeptides of SEQ ID No. 2 to SEQ ID No. 1 197. 

Upon introduction of minor frameshifts, certain individual ORFs can comprise larger 

10 «combined» ORFs. A list of such putative «combined» ORFs is shown in Table 3, below. For 
example, a combined ORF can comprise ORF 1076 and ORF 1073, including intervening in-frame, 
nucleotide sequences. The order of ORFs (5' to 3'), within each «combined» ORF is as listed. It is to 
be understood that when ORF2 to ORF 1197 are referred to herein, such reference is also meant to 
include «combined» ORFs. Polypeptide sequences encoded by such «combined» ORFs are also part 

1 5 of the present invention . 
Table 3 

ORF 1076, ORF 1073; 

ORF 3, ORF 2; 

ORF 23, ORF 22, ORF 21; 
20 ORF 1141, ORF 477, ORF 478, ORF 479; 

ORF 487, ORF 486, ORF 485, ORF 484, ORF 483, ORF 482, ORF 481 ; 

ORF 488, ORF 489; 

ORF 573, ORF 572, ORF 571 ; 

ORF 817, ORF 818; 
25 ORF 819, ORF 820; 

ORF 1037, ORF 1038; 

ORF 1071, ORF 1070; 

ORF 17, ORF 1077; 

ORF 1 185, ORF 933, ORF 934; 
30 ORF 1060, ORF 1059; 

ORF 155, ORF 156; 

ORF 679, ORF 680; 

ORF 879, ORF 878; 

ORF 1028; ORF 1029, 
35 and representative fragments. 
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Table 1 also depicts the results of homology searches that compared the sequences of 
the polypeptides encoded by each of the ORFs to sequences present in public published databases. It 
is understood that in one embodiment, those polypeptides listed in Table 1 as exhibiting greater than 
about 95% identity to a polypeptide present in a publicly disclosed database are not considered part of 
5 the present invention; likewise in this embodiment, those nucleotide sequences encoding such 
polypeptides are not considered part of the invention. In another embodiment, it is understood that 
those polypeptides listed in Table 1 as exhibiting greater than about 99% identity to a polypeptide 
present in a publicly disclosed database are not considered part of the invention; likewise, in this 
embodiment, those nucleotide sequences encoding such polypeptides are not considered part of the 
10 invention. 

The invention also relates to the nucleotide sequences characterized in that they 
comprise a nucleotide sequence chosen from: 

a) an ORF2 to ORF1197 , a «combined» ORF nucleotide sequence, the nucleotide 
sequence of the genomic DNA contained within ECACC Deposit No. 98112618 or the nucleotide 

15 sequence of a clone insert in ECACC Deposit No. 981 12617 according to the invention; 

b) a homologous nucleotide sequence exhibiting at least 80% identity across an entire 
ORF2 to ORF 1 197 nucleotide sequence according to the invention or as defined in a); 

c) a polynucleotide sequence that hybridizes to ORF2 to ORF1 197 under conditions of 
high or intermediate stringency as described below: 

20 (0 B y wa y of example and not limitation, procedures using conditions of high 

stringency are as follows: Prehybridization of filters containing DNA is carried out for 8 h to 
overnight at 65°C in buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% 
PVP, 0.02% Ficoll, 0.02% BSA, and 500 u,g/ml denatured salmon sperm DNA. Filters are hybridized 
for 48 h at 65°C, the preferred hybridization temperature, in prehybridization mixture containing 100 

25 ug/ml denatured salmon sperm DNA and 5-20 X 10 fi cpm of 32 P-labeled probe. Alternatively, the 
hybridization step can be performed at 65°C in the presence of SSC buffer, 1 x SSC corresponding to 
0.1 5M NaCl and 0.05 M Na citrate. Subsequently, filter washes can be done at 37°C for 1 h in a 
solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X 
SSC at 50°C for 45 min. Alternatively, filter washes can be performed in a solution containing 2 x 

30 SSC and 0.1% SDS, or 0.5 x SSC and 0.1% SDS, or 0.1 x SSC and 0.1% SDS at 68°C for 15 minute 
intervals. Following the wash steps, the hybridized probes are detectable by autoradiography. Other 
conditions of high stringency which may be used are well known in the art and as cited in Sambrook 
et al., 1989, Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, 
N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green 

35 Publishing Associates and Wiley Interscience, N.Y. are incorporated herein in their entirety. 
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Preferably, such sequences encode a homolog of a polypeptide encoded by one of ORF2 to ORF1 197. 
In one embodiment, such sequences encode a Chlamydia trachomatis polypeptide. 

(ii) By way of example and not limitation, procedures using conditions of 
intermediate stringency are as follows: Filters containing DNA are prehybridized, and then hybridized 
5 at a temperature of 60°C in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50°C and the hybridized probes are 
detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
well known in the art and as cited in Sambrook et al., 1989, Molecular Cloning, A Laboratory 
Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, 
10 Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. 
are incorporated herein in their entirety. Preferably, such sequences encode a homolog of a 
polypeptide encoded by one of ORF2 to ORF1197 . In one embodiment, such sequences encode a 
Chlamydia trachomatis polypeptide. 

d) a complementary or RNA nucleotide sequence corresponding to an ORF2 to 
15 ORF1 197 sequence according to the invention or as defined in a), b) or c); 

e) a nucleotide sequence of a representative fragment of an ORF2 to ORF1197 
sequence according to the invention or of a sequence as defined in a), b), c) or d); 

f) a nucleotide sequence capable of being obtained from an ORF2 to ORF1197 
sequence according to the invention or as defined in a), b), c), d) or e); and 

20 g) a modified nucleotide sequence of an ORF2 to ORF1 197 sequence according to the 

invention or as defined in a), b), c), d), e) or f). 

As regards the homology with the ORF2 to ORF1197 nucleotide sequences, the 
homologous sequences exhibiting a percentage identity with the bases of one of the ORF2 to 
ORF1197 nucleotide sequences of at least 80%, preferably 90% and 95%, are preferred. Such 

25 homologous sequences are identified routinely via, for example, the algorithms described above and 
in the examples below. The said homologous sequences correspond to the homologous sequences as 
defined above and may comprise, for example, the sequences corresponding to the ORF sequences of 
a bacterium belonging to the Chlamydia family, including the species Chlamydia pneumoniae, 
Chlamydia psittaci and Chlamydia pecorum mentioned above, as well as the sequences corresponding 

30 to the ORF sequences of a bacterium belonging to the variants of the species Chlamydia trachomatis. 
These homologous sequences may likewise correspond to variations linked to mutations within the 
same species or between species and may correspond in particular to truncations, substitutions, 
deletions and/or additions of at least one nucleotide. The said homologous sequences may also 
correspond to variations linked to the degeneracy of the genetic code or to a bias in the genetic code 

35 which is specific to the family, to the species or to the variant and which are likely to be present in 
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Chlamydia. 

The invention comprises the polypeptides encoded by a nucleotide sequence 
according to the invention, preferably by a representative fragment of the sequence SEQ ID No. 1 and 
corresponding to an ORF sequence, in particular the Chlamydia trachomatis polypeptides, 
5 characterized in that they are chosen from the sequences SEQ ID No. 2 to SEQ ID No. 1 197 , 

and representative fragments thereof. However, the invention is not limited to polypeptides encoded 
by ORFs in SEQ ID No. 1 and its corresponding ORF sequences, but encompasses polypeptides of 
strain variants, polymorphisms, allelic variants, and mutants. 

Thus, the invention also comprises the polypeptides characterized in that they 
1 0 comprise a polypeptide chosen from: 

a) a polypeptide encoded by a polynucleotide sequence in SEQ ID No. 1 {e.g., any 
polypeptide encoded by a polynucleotide sequence corresponding to ORF2 to ORF1197) and/or 
representative fragments thereof according to the invention; 

b) a polypeptide homologous to a polypeptide according to the invention, or as defined 

15 in a); 

c) a polypeptide encoded by a polynucleotide sequence that hybridizes to SEQ ID No. 1 
or ORF2 to ORF1 197 under high or intermediate stringency as described below: 

(i) By way of example and not limitation, procedures using conditions of high 
stringency are as follows: Prehybridization of filters containing DNA is carried out for 8 h to 

20 overnight at 65°C in buffer composed of 6X SSC, 50 mM Tris-HCI (pH 7.5), 1 niM EDTA, 0.02% 
PVP, 0.02% Ficoll, 0.02% BSA, and 500 jig/ml denatured salmon sperm DNA. Filters are hybridized 
for 48 h at 65°C, the preferred hybridization temperature, in prehybridization mixture containing 100 
^g/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Alternatively, the 
hybridization step can be performed at 65°C in the presence of SSC buffer, 1 x SSC corresponding to 

25 0.15M NaCl and 0.05 M Na citrate. Subsequently, filter washes can be done at 37°C for 1 h in a 
solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X 
SSC at 50°C for 45 min. Alternatively, filter washes can be performed in a solution containing 2 x 
SSC and 0.1% SDS, or 0.5 x SSC and 0.1% SDS, or 0.1 x SSC and 0.1% SDS at 68°C for 15 minute 
intervals. Following the wash steps, the hybridized probes are detectable by autoradiography. Other 

30 conditions of high stringency which may be used are well known in the art and as cited in Sambrook 
et al., 1989, Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, 
N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green 
Publishing Associates and Wiley Interscience, N.Y. are incorporated herein in their entirety. 
Preferably, such sequences encode a homolog of a polypeptide encoded by one of ORF2 to ORF1 197 

35 . In one embodiment, such sequences encode a Chlamydia trachomatis polypeptide. 
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(ii) By way of example and not limitation, procedures using conditions of 
intermediate stringency are as follows: Filters containing DNA are prehybridized, and then hybridized 
at a temperature of 60°C in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50°C and the hybridized probes are 
5 detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
well known in the art and as cited in Sambrook et al., 1989, Molecular Cloning, A Laboratory 
Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, 
Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. 
are incorporated herein in their entirety. Preferably, such sequences encode a homolog of a 
10 polypeptide encoded by one of ORF2 to ORF1197 . In one embodiment, such sequences encode a 
Chlamydia trachomatis polypeptide. 

d) a fragment of at least 5 amino acids of a polypeptide according to the invention, or as 
defined in a), b) or c); 

e) a biologically active fragment of a polypeptide according to the invention, or as 
15 defined in a), b), c) or d); and 

f) a modified polypeptide of a polypeptide according to the invention, as defined in a), 
b), c), d) or e). 

In the present description, the terms polypeptide, peptide and protein are 

interchangeable. 

20 It should be understood that the invention does not relate to the polypeptides in 

natural form, that is to say that they are not taken in their natural environment but that they may have 
been isolated or obtained by purification from natural sources, or alternatively obtained by genetic 
recombination, or else by chemical synthesis and that they may, in this case, comprise nonnatural 
amino acids, as will be described below. 

25 Homologous polypeptide will be understood to designate the polypeptides exhibiting, 

in relation to the natural polypeptide, certain modifications such as in particular a deletion, addition or 
substitution of at least one amino acid, a truncation, an extension, a chimeric fusion, and/or a 
mutation, or polypeptides exhibiting post-translational modifications. Among the homologous 
polypeptides, those whose amino acid sequence exhibits at least 80%, preferably 90%, homology or 

30 identity with the amino acid sequences of the polypeptides according to the invention are preferred. In 
the case of a substitution, one or more consecutive or nonconsecutive amino acids are replaced by 
«equivalent» amino acids. The expression «equivalent» amino acid is intended here to designate any 
amino acid capable of being substituted for one of the amino acids in the basic structure without 
however, essentially modifying the biological activities of the corresponding peptides and as will be 

35 defined later. 
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Protein and/or nucleic acid sequence homologies may be evaluated using any of the 
variety of sequence comparison algorithms and programs known in the art. Such algorithms and 
programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and 
CLUSTALW (Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85(8): 2444-2448; Altschul et 
5 al, 1990, J. Mol. Biol. 275^:403-410; Thompson et al, 1994, Nucleic Acids Res. 22(2):4673-4680; 
Higgins et al, 1996, Methods Enzymol. 266:383-402; Altschul et al., 1990, J. Mol. Biol. 2 15(3 ):403- 
410; Altschul et al, 1993, Nature Genetics 3:266-272). 

In a particularly preferred embodiment, protein and nucleic acid sequence homologies 
are evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well know in the art 
10 (see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 57:2267-2268; Altschul et al., 1990, 
J. Mol. Biol. 275:403-410; Altschul et al, 1993, Nature Genetics 3:266-272; Altschul et al, 1997, 
Nuc. Acids Res. 25:3389-3402). In particular, five specific BLAST programs are used to perform the 
following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein sequence 
15 database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence database; 

(3) BLASTX compares the six-frame conceptual translation products of a query nucleotide 
sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database 
20 translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against the 
six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar segments, which are 
referred to herein as "high-scoring segment pairs," between a query amino or nucleic acid sequence 

25 and a test sequence which is preferably obtained from a protein or nucleic acid sequence database. 
High-scoring segment pairs are preferably identified (i.e., aligned) by means of a scoring matrix, 
many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 matrix 
(Gonnet etal, 1992, Science 256:1443-1445; Henikoff and Henikoff, 1993, Proteins 77:49-61). Less 
preferably, the PAM or PAM250 matrices may also be used (see, e.g., Schwartz and Dayhoff, eds., 

30 1978, Matrices for Detecting Distance Relationships: Atlas of Protein Sequence and Structure, 
Washington: National Biomedical Research Foundation) 

The BLAST programs evaluate the statistical significance of all high-scoring segment 
pairs identified, and preferably selects those segments which satisfy a user-specified threshold of 
significance, such as a user-specified percent homology. Preferably, the statistical significance of a 

35 high-scoring segment pair is evaluated using the statistical significance formula of Karlin (see, e.g., 
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Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 57:2267-2268). 

Equivalent amino acids may be determined either based on their structural homology 
with the amino acids for which they are substituted, or on results of comparative tests of biological 
activity between the various polypeptides which may be carried out. 
5 By way of example, there may be mentioned the possibilities of substitutions which 

may be carried out without resulting in a substantial modification of the biological activity of the 
corresponding modified polypeptides; the replacements, for example, of leucine with valine or 
isoleucine, of aspartic acid with glutamic acid, of glutamine with asparagine, of arginine with lysine, 
and the like, the reverse substitutions naturally being feasible under the same conditions. 
10 The homologous polypeptides also correspond to the polypeptides encoded by the 

homologous nucleotide sequences as defined above and thus comprise in the present definition the 
mutated polypeptides or polypeptides corresponding to inter- or intra-species variations which may 
exist in Chlamydia, and which correspond in particular to truncations, substitutions, deletions and/or 
additions of at least one amino acid residue. 
15 Biologically active fragment of a polypeptide according to the invention will be 

understood to designate in particular a polypeptide fragment, as defined below, exhibiting at least one 
of the characteristics of the polypeptides according to the invention, in particular in that it is: 

capable of eliciting an immune response directed against Chlamydia trachomatis; and/or 
capable of being recognized by an antibody specific for a polypeptide according to the 
20 invention; and/or 

capable of binding to a polypeptide or to a nucleotide sequence of Chlamydia trachomatis; 

and/or 

capable of modulating, regulating, inducing or inhibiting the expression of a gene of 
Chlamydia trachomatis or one of its associated microorganisms, and/or capable of modulating the 

25 replication cycle of Chlamydia trachomatis or one of its associated microorganisms in the host cell 
and/or organism; and/or 

capable of generally exerting an even partial physiological activity, such as for example a 
structural activity (cellular envelope, ribosome), an enzymatic (metabolic) activity, a transport 
activity, an activity in the secretion or in the virulence. 

30 A representative polypeptide fragment according to the invention is understood to 

designate a polypeptide comprising a minimum of 5 amino acids, preferably 10 amino acids or 
preferably 15 amino acids. It is to be understood that such fragments refer only to portions of 
polypeptides encoded by ORF2 or ORF1197 that are not currently listed in a publicly available 
database. 

35 The polypeptide fragments according to the invention may correspond to isolated or 
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purified fragments which are naturally present in Chlamydia trachomatis or which are secreted by 
Chlamydia trachomatis, or may correspond to fragments capable of being obtained by cleaving the 
said polypeptide with a proteolytic enzyme, such as trypsin or chymotrypsin or collagenase, or with a 
chemical reagent, such as cyanogen bromide (CNBr) or alternatively by placing the said polypeptide 
5 in a highly acidic environment, for example at pH 2.5. Such polypeptide fragments may be equally 
well prepared by chemical synthesis, using hosts transformed with an expression vector according to 
the invention containing a nucleic acid allowing the expression of the said fragments, placed under 
the control of appropriate elements for regulation and/or expression. 

«Modified polypeptide» of a polypeptide according to the invention is understood to 

10 designate a polypeptide obtained by genetic recombination or by chemical synthesis as will be 
described below, exhibiting at least one modification in relation to the normal sequence. These 
modifications may in particular affect amino acids responsible for a specificity or for the efficiency of 
the activity, or responsible for the structural conformation, for the charge or for the hydrophobicity, 
and for the capacity for multimerization and for membrane insertion of the polypeptide according to 

15 the invention. It is thus possible to create polypeptides with an equivalent, an increased or a reduced 
activity, and with an equivalent, a narrower or a broader specificity. Among the modified 
polypeptides, there may be mentioned the polypeptides in which up to 5 amino acids may be 
modified, truncated at the N- or C-terminal end, or alternatively deleted, or else added. 

As is indicated, the modifications of the polypeptide may have in particular the 

20 objective: 

of making it capable of modulating, regulating, inhibiting or inducing the expression of a gene 
of Chlamydia, in particular of Chlamydia trachomatis and its variants, or one of its associated micro- 
organisms, and/or capable of modulating the replication cycle of Chlamydia, in particular of 
Chlamydia trachomatis and its variants, or one of its associated microorganisms, in the host cell 
25 and/or organism, 

of allowing its use in methods of biosynthesis or of biodegradation, or its incorporation into 
vaccine compositions, 

of modifying its bioavailability as a compound for therapeutic use. 

The said modified polypeptides may also be used on any cell or microorganism for 
30 which the said modified polypeptides will be capable of modulating, regulating, inhibiting or inducing 
gene expression, or of modulating the growth or the replication cycle of the said cell or of the said 
microorganism. The methods allowing demonstration of the said modulations on eukaryotic or 
prokaryotic cells are well known to persons skilled in the art. The said cells or microorganisms will be 
chosen, in particular, from tumour cells or infectious microorganisms and the said modified 
35 polypeptides may be used for the prevention or treatment of pathologies linked to the presence of the 
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said cells or of the said microorganisms. It is also clearly understood that the nucleotide sequences 
encoding the said modified polypeptides may be used for the said modulations, for example by the 
intermediacy of vectors according to the invention and which are described below, so as to prevent or 
to treat the said pathologies. 
5 The above modified polypeptides may be obtained using combinatory chemistry, in 

which it is possible to systematically vary portions of the polypeptide before testing them on models, 
cell cultures or microorganisms for example, so as to select the compounds which are the most active 
or which exhibit the desired properties. 

Chemical synthesis also has the advantage of being able to use: 
10 - nonnatural amino acids, or 

- nonpeptide bonds. 

Accordingly, in order to extend the life of the polypeptides according to the 
invention, it may be advantageous to use nonnatural amino acids, for example in the D form, or 
alternatively amino acid analogues, in particular sulphur-containing forms for example. 

15 Finally, the structure of the polypeptides according to the invention, its homologous 

or modified forms, as well as the corresponding fragments may be integrated into chemical structures 
of the polypeptide type and the like. Accordingly, it may be advantageous to provide at the N- and 
C-terminal ends compounds which are not recognized by proteases. 

Also forming part of the invention are the nucleotide sequences encoding a 

20 polypeptide according to the invention. Described below are ORF nucleotide sequences encoding 
polypeptides exhibiting particularly preferable characteristics. For each group of preferred ORFs 
described below, it is to be understood that in addition to the individual ORFs listed, in instances 
wherein such ORFs are present as part of «combined» ORFs, the «combined» ORFs are also to be 
included within the preferred group. 

25 More particularly, the subject of the invention is nucleotide sequences, characterized 

in that they encode a polypeptide of the cellular envelope, preferably of the outer cellular envelope of 
Chlamydia trachomatis or one of its representative fragments, such as for example the predominant 
proteins of the outer membrane, the adhesion proteins or the proteins entering into the composition of 
the Chlamydia wall. Among these sequences, the sequences comprising a nucleotide sequence chosen 

30 from the following sequences are most preferred: 

ORF3; ORF19; ORF51; ORF189; ORF212; ORF213; ORF324; ORF477; ORF478; ORF479; 
ORF481; ORF482; ORF483; ORF484; ORF486; ORF488; ORF489; ORF490; ORF572; ORF573; 
ORF742; ORF817; ORF818; ORF820; ORF1035; ORF1036; ORF1037; ORF1038; ORF1070; 
ORF1071; ORF1073 and one of their representative fragments. 

35 The structure of the cytoplasmic membranes and of the wall of bacteria is dependent 
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on the associated proteins. The structure of the cytoplasmic membrane makes it impermeable to 
water, to water-soluble substances and to small-sized molecules (ions, small inorganic molecules, 
peptides or proteins). To enter into or to interfere with a cell or a bacterium, a ligand must establish a 
special relationship with a protein anchored in the cytoplasmic membrane (the receptor). These 
5 proteins which are anchored on the membrane play an important role in metabolism since they control 
the exchanges in the bacterium. These exchanges apply to molecules of interest for the bacterium 
(small molecules such as sugars and small peptides) as well as undesirable molecules for the 
bacterium such as antibiotics or heavy metals. 

The double lipid layer structure of the membrane requires the proteins which are 

10 inserted therein to have hydrophobic domains of about twenty amino acids forming an alpha helix. 
Predominantly hydrophobic and potentially transmembrane regions may be predicted from the 
primary sequence of the proteins, itself deduced from the nucleotide sequence. The presence of one or 
more putative transmembrane domains raises the possibility for a protein to be associated with the 
cytoplasmic membrane and to be able to play an important metabolic role therein or alternatively for 

15 the protein thus exposed to be able to exhibit potentially protective epitopes. 

If the proteins inserted into the membrane exhibit several transmembrane domains 
capable of interacting with one another via electrostatic bonds, it then becomes possible for these 
proteins to form pores which go across the membrane which becomes permeable for a number of 
substances. It should be noted that proteins which do not have transmembrane domains may also be 

20 anchored by the intermediacy of fatty acids in the cytoplasmic membrane, it being possible for the 
breaking of the bond between the protein and its anchor in some cases to be responsible for the 
release of the peptide outside the bacterium. 

Preferably, the invention relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis transmembrane polypeptide or 

25 one of its representative fragments, having between 1 and 3 transmembrane domains and in that they 
comprise a nucleotide sequence chosen from the following sequences: 

ORF2; ORF3; ORF5; ORF8; ORF9; ORF10; ORF11; ORF12; ORF17; ORF21; ORF26; ORF27; 

ORF28; ORF29; ORF30; ORF31; ORF33; ORF35; ORF37; ORF39; ORF40; ORF41; ORF42; 

ORF43; ORF44; ORF45; ORF46; ORF47; ORF48; ORF49; ORF52; ORF53; ORF55; ORF56; 
30 ORF58; ORF65; ORF66; ORF68; ORF70; ORF74; ORF75; ORF76; ORF78; ORF79; ORF81; 

ORF82; ORF83; ORF86; ORF91; ORF92; ORF94; ORF97; ORF100; ORF102; ORF103; ORF105; 

ORF106; ORF107; ORF109; ORF110; ORF111; ORF112; ORF113; ORF114; ORF115; ORF116; 

ORF117; ORF120; ORF122; ORF123; ORF130; ORF134; ORF135; ORF137; ORF140; ORF141; 

ORF143; ORF144; ORF145; ORF147; ORF148; ORF149; ORF150; ORF151; ORF155; ORF156; 
35 ORF162; ORF163; ORF164; ORF165; ORF166; ORF167; ORF168; ORF169; ORF170; ORF171; 
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ORF173; ORF175; ORF176; ORF177; ORF181; ORF183; ORF184; ORF186; ORF187; ORF188; 

ORF190; ORF191; ORF192; ORF194; ORF195; ORF196; ORF197; ORF198; ORF199; ORF201; 

ORF202; ORF204; ORF206; ORF207; ORF209; ORF212; ORF213; ORF217; ORF219; ORF220; 

ORF221; ORF222; ORF223; ORF224; ORF225; ORF227; ORF228; ORF231; ORF232; ORF234; 
5 ORF236; ORF237; ORF243; ORF244; ORF245; ORF247; ORF248; ORF249; ORF252; ORF254; 

ORF257; ORF260; ORF261; ORF263; ORF265; ORF266; ORF267; ORF270; ORF271; ORF272; 

ORF274; ORF276; ORF277; ORF278; ORF279; ORF282; ORF283; ORF284; ORF285; ORF287; 

ORF289; ORF290; ORF291; ORF294; ORF298; ORF305; ORF306; ORF310; ORF311; ORF313; 

ORF315; ORF316; ORF319; ORF320; ORF322; ORF323; ORF325; ORF326; ORF327; ORF328; 
10 ORF330; ORF331; ORF332; ORF333; ORF334; ORF335; ORF336; ORF338; ORF339; ORF340; 

ORF341; ORF344; ORF345; ORF348; ORF349; ORF350; ORF351; ORF352; ORF353; ORF356; 

ORF357; ORF358; ORF361; ORF362; ORF366; ORF367; ORF368; ORF370; ORF372; ORF373; 

ORF375; ORF377; ORF378; ORF379; ORF380; ORF382; ORF383; ORF384; ORF385; ORF387; 

ORF389; ORF390; ORF391; ORF393; ORF396; ORF398; ORF399; ORF403; ORF404; ORF406; 
15 ORF407; ORF413; ORF414; ORF417; ORF418; ORF420; ORF421; ORF424; ORF426; ORF427; 

ORF428; ORF430; ORF433; ORF434; ORF435; ORF436; ORF437; ORF440; ORF443; ORF446; 

ORF448; ORF450; ORF451; ORF454; ORF455; ORF457; ORF458; ORF459; ORF463; ORF464; 

ORF466; ORF467; ORF468; ORF469; ORF470; ORF473; ORF474; ORF475; ORF476; ORF477; 

ORF479; ORF480; ORF481; ORF483; ORF484; ORF485; ORF486; ORF487; ORF488; ORF491; 
20 ORF493; ORF496; ORF497; ORF498; ORF500; ORF501; ORF503; ORF504; ORF508; ORF512; 

ORF513; ORF514; ORF519; ORF521; ORF523; ORF524; ORF526; ORF527; ORF529; ORF530; 

ORF531; ORF532; ORF534; ORF536; ORF537; ORF538; ORF540; ORF541; ORF542; ORF543; 

ORF544; ORF545; ORF546; ORF547; ORF551; ORF552; ORF553; ORF555; ORF558; ORF559; 

ORF560; ORF561; ORF562; ORF566; ORF567; ORF568; ORF569; ORF571; ORF572; ORF574; 
25 ORF575; ORF576; ORF580; ORF582; ORF585; ORF587; ORF589; ORF592; ORF593; ORF595; 

ORF596; ORF597; ORF599; ORF601; ORF602; ORF603; ORF604; ORF608; ORF609; ORF610; 

ORF611; ORF615; ORF616; ORF617; ORF618; ORF621; ORF622; ORF623; ORF624; ORF625; 

ORF628; ORF632; ORF633; ORF634; ORF635; ORF637; ORF638; ORF640; ORF641; ORF643; 

ORF646; ORF648; ORF649; ORF651; ORF652; ORF653; ORF654; ORF655; ORF658; ORF664; 
30 ORF665; ORF666; ORF668; ORF669; ORF670; ORF671; ORF672; ORF673; ORF674; ORF676; 

ORF677; ORF678; ORF680; ORF682; ORF683; ORF684; ORF686; ORF688; ORF689; ORF690; 

ORF691; ORF692; ORF693; ORF695; ORF696; ORF698; ORF701; ORF703; ORF704; ORF705; 

ORF706; ORF707; ORF709; ORF710; ORF711; ORF712; ORF713; ORF714; ORF715; ORF717; 

ORF718; ORF720; ORF721; ORF722; ORF724; ORF726; ORF728; ORF729; ORF730; ORF731; 
35 ORF732; ORF733; ORF734; ORF737; ORF738; ORF739; ORF740; ORF742; ORF743; ORF744; 
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ORF745; ORF746; ORF748; ORF750; ORF751; ORF752; ORF753; ORF754; ORF755; ORF757; 

ORF758; ORF759; ORF760; ORF764; ORF766; ORF768; ORF769; ORF771; ORF772; ORF773; 

ORF774; ORF775; ORF776; ORF777; ORF778; ORF779; ORF780; ORF781; ORF782; ORF783; 

ORF786; ORF787; ORF788; ORF789; ORF790; ORF793; ORF798; ORF800; ORF802; ORF803; 
5 ORF806; ORF808; ORF809; ORF810; ORF811; ORF813; ORF814; ORF817; ORF820; ORF822; 

ORF824; ORF825; ORF827; ORF828; ORF829; ORF830; ORF833; ORF834; ORF835; ORF837; 

ORF838; ORF839; ORF840; ORF841; ORF842; ORF843; ORF845; ORF848; ORF849; ORF850; 

ORF851; ORF852; ORF854; ORF855; ORF856; ORF857; ORF859; ORF860; ORF862; ORF863; 

ORF864; ORF866; ORF869; ORF872; ORF873; ORF874; ORF878; ORF879; ORF880; ORF881; 
10 ORF883; ORF884; ORF885; ORF886; ORF887; ORF892; ORF893; ORF894; ORF895; ORF897; 

ORF899; ORF900; ORF901; ORF904; ORF906; ORF909; ORF910; ORF912; ORF914; ORF917; 

ORF920; ORF921; ORF922; ORF923; ORF924; ORF925; ORF926; ORF927; ORF930; ORF933; 

ORF934; ORF935; ORF936; ORF937; ORF940; ORF941; ORF942; ORF943; ORF944; ORF945; 

ORF947; ORF948; ORF951; ORF952; ORF953; ORF954; ORF955; ORF956; ORF957; ORF958; 
15 ORF960; ORF961; ORF962; ORF963; ORF964; ORF966; ORF967; ORF969; ORF970; ORF971; 

ORF973; ORF974; ORF979; ORF980; ORF981; ORF982; ORF984; ORF988; ORF989; ORF990; 

ORF991; ORF995; ORF996; ORF999; ORF1001; ORF1003; ORF1004; ORF1005; ORF1006; 

ORF1007; ORF1009; ORF1010; ORF1011; ORF1012; ORF1013; ORF1014; ORF1016; ORF1017; 

ORF1018; ORF1020; ORF1021; ORF1025; ORF1026; ORF1027; ORF1029; ORF1030; ORF1031; 
20 ORF1035; ORF1036; ORF1037; ORF1038; ORF1039; ORF1040; ORF1044; ORF1045; ORF1047; 

ORF1048; ORF1050; ORF1051; ORF1052; ORF1053; ORF1055; ORF1056; ORF1057; ORF1058; 

ORF1061; ORF1062; ORF1063; ORF1064; ORF1065; ORF1066; ORF1068; ORF1069; ORF1072; 

ORF1074; ORF1076 and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
25 invention, characterized in that they encode a Chlamydia trachomatis transmembrane polypeptide or 

one of its representative fragments, having between 4 and 6 transmembrane domains and in that they 

comprise a nucleotide sequence chosen from the following sequences: 

ORF7; ORF14; ORF16; ORF32; ORF34; ORF36; ORF38; ORF50; ORF57; ORF59; ORF61; ORF62; 

ORF63; ORF64; ORF67; ORF69; ORF72; ORF77; ORF80; ORF84; ORF87; ORF93; ORF95; 
30 ORF99; ORF108; ORF119; ORF125; ORF126; ORF129; ORF131; ORF136; ORF139; ORF146; 

ORF152; ORF154; ORF160; ORF161; ORF172; ORF179; ORF182; ORF185; ORF200; ORF203; 

ORF205; ORF239; ORF242; ORF250; ORF253; ORF256; ORF259; ORF262; ORF268; ORF275; 

ORF281; ORF286; ORF288; ORF292; ORF295; ORF296; ORF297; ORF299; ORF300; ORF308; 

ORF314; ORF317; ORF318; ORF324; ORF342; ORF343; ORF355; ORF360; ORF374; ORF376; 
35 ORF386; ORF388; ORF392; ORF394; ORF395; ORF402; ORF405; ORF411; ORF415; ORF416; 
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ORF422; ORF423; ORF429; ORF432; ORF441; ORF442; ORF444; ORF449; ORF452; ORF456; 

ORF460; ORF461; ORF465; ORF471; ORF472; ORF482; ORF489; ORF492; ORF494; ORF495; 

ORF502; ORF505; ORF506; ORF509; ORF516; ORF517; ORF520; ORF525; ORF533; ORF539; 

ORF549; ORF554; ORF557; ORF563; ORF570; ORF573; ORF581; ORF590; ORF591; ORF600; 
5 ORF607; ORF612; ORF613; ORF620; ORF626; ORF629; ORF630; ORF639; ORF644; ORF647; 

ORF656; ORF659; ORF661; ORF685; ORF687; ORF699; ORF700; ORF708; ORF716; ORF719; 

ORF725; ORF747; ORF749; ORF756; ORF765; ORF767; ORF794; ORF796; ORF797; ORF799; 

ORF801; ORF807; ORF821; ORF823; ORF826; ORF847; ORF853; ORF861; ORF870; ORF871; 

ORF875; ORF882; ORF888; ORF889; ORF898; ORF902; ORF903; ORF911; ORF916; ORF931; 
10 ORF939; ORF975; ORF976; ORF978; ORF983; ORF986; ORF987; ORF992; ORF993; ORF1000; 

ORF1002; ORF1008; ORF1019; ORF1022; ORF1032; ORF1034; ORF1046; ORF1054; ORF1060; 

ORF1071 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis transmembrane polypeptide or 
15 one of its representative fragments, having at least 7 transmembrane domains and in that they 

comprise a nucleotide sequence chosen from the following sequences: 

ORF4; ORF6; ORF13; ORF20; ORF51; ORF71; ORF88; ORF118; ORF128; ORF132; ORF133; 

ORF158; ORF159; ORF174; ORF180; ORF189; ORF210; ORF211; ORF214; ORF215; ORF226; 

ORF229; ORF233; ORF235; ORF240; ORF246; ORF251; ORF255; ORF273; ORF354; ORF364; 
20 ORF369; ORF371; ORF397; ORF401; ORF409; ORF412; ORF419; ORF439; ORF453; ORF462; 

ORF490; ORF510; ORF511; ORF518; ORF535; ORF548; ORF550; ORF564; ORF565; ORF578; 

ORF579; ORF614; ORF631; ORF636; ORF650; ORF662; ORF667; ORF679; ORF681; ORF702; 

ORF727; ORF741; ORF763; ORF791; ORF792; ORF815; ORF816; ORF832; ORF846; ORF858; 

ORF865; ORF867; ORF868; ORF877; ORF891; ORF896; ORF907; ORF908; ORF918; ORF919; 
25 ORF932; ORF959; ORF977; ORF994; ORF998; ORF1024; ORF1028; ORF1042; ORF1067; 

ORF1070; ORF1073 and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis surface exposed polypeptide 

(e^, an outer membrane protein) or one of its representative fragments, said nucleotide sequences 
30 comprising a nucleotide sequence chosen from the following sequences: 

ORF 2, ORF 3, ORF 21, ORF 22, ORF 23, ORF 53, ORF 77, ORF 187, ORF 203, ORF 383, ORF 

477, ORF 478, ORF 479, ORF 481, ORF 482, ORF 483, ORF 484, ORF 485, ORF 486, ORF 487, 

ORF 488, ORF 489, ORF 490, ORF 571, ORF 572, ORF 573, ORF 593, ORF 670, ORF 693, ORF 

742, ORF 749, ORF 801, ORF 817, ORF 818, ORF 819, ORF 820, ORF 851, ORF 902, ORF 923, 
35 ORF 1035, ORF 1036, ORF 1037, ORF 1038, ORF 1069, ORF 1070, ORF 1071, ORF 1073, ORF 
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1076, ORF 1095, ORF 1096, ORF 1 141, ORF 1181, and their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis lipoprotein or one of its 
representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from 
5 the following sequences: 

ORF 29, ORF 42, ORF 66, ORF 72, ORF 76, ORF 78, ORF 148, ORF 154, ORF 180, ORF 182, ORF 
184, ORF 187, ORF 200, ORF 242, ORF 245, ORF 250, ORF 253, ORF 272, ORF 274, ORF 275, 
ORF 308, ORF 350, ORF 362, ORF 383, ORF 394, ORF 396, ORF 399, 

ORF 422, ORF 488, ORF 535, ORF 568, ORF 573, ORF 578, ORF 593, ORF 607, ORF 625, ORF 
10 662, ORF 669, ORF 688, ORF 690, ORF 716, ORF 773, ORF 778, ORF 781, ORF 783, ORF 788, 

ORF 817, ORF 848, ORF 851, ORF 853, ORF 857, ORF 875, ORF 877, ORF 886, ORF 898, ORF 

902, ORF 923, ORF 938, ORF 976, ORF 978, ORF 990, ORF 1005, ORF 1021, ORF 1035, ORF 

1069, ORF 1083, ORF 1088, ORF 1089, ORF 1091, ORF 1092, ORF 1095, ORF 1096, ORF 1100, 

ORF 1105, ORF 1108, ORF 1117, ORF 1120, ORF 1121, ORF 1124, ORF 1128, ORF 1133, ORF 
15 1135, ORF 1139, ORF 1140, ORF 1157, ORF 1159, ORF 1163, ORF 1165, ORF 1167, ORF 1168, 

ORF 1169, ORF 1171, ORF 1173, ORF 1174, ORF 1177, ORF 1180, ORF 1181, ORF 1186, ORF 

1 194, ORF 1 197, and their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide involved in 
20 lipopolysaccharide (LPS) biosynthesis, said nucleotide sequences comprising a nucleotide sequence 

chosen from the following sequences: ORF 17, ORF 201, ORF 691, ORF 807, ORF 936, ORF 983, 

ORF 1019, ORF 1077 and one of their representative fragments. 

Preferably the invention relates to additional LPS-related nucleotide sequences 

according to the invention, characterized in that they encode: 
25 (a) a Chlamydia trachomatis KDO (3-deoxy-D-manno-octulosonic acid)-related 

polypeptide or one of its representative fragments, said nucleotide sequences comprising a nucleotide 

sequence chosen from the following sequences: ORF 41, ORF 242, ORF 269, ORF 772, and one of 

their representative fragments; 

(b) a Chlamydia trachomatis phosphomannomutase-related polypeptide or one of 
30 its representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from 

the following sequence: ORF 139, and one of its representative fragments; 

(c) a Chlamydia trachomatis phosphoglucomutase-related polypeptide or one of its 
representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from 
the following sequence: ORF 567, and one of its representative fragments; and 

35 (d) a Chlamydia trachomatis lipid A component-related polypeptide or one of its 



WO 99/28475 PCT/IB98/01939 
27 

representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from 
the following sequences: ORF 4, ORF 933, ORF 934, ORF 935, ORF 1185, and one of their 
representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
5 invention, characterized in that they encode a Chlamydia trachomatis Type m or other, non-Type HI 
secreted polypeptides or one of its representative fragments, said nucleotide sequences comprising a 
nucleotide sequence chosen from the following sequences: ORF 180, ORF 181, ORF 207, ORF 208, 
ORF 372, ORF 391, ORF 399, ORF 477, ORF 486, ORF 749, ORF 758, ORF 819, ORF 878, ORF 
888, ORF 896, ORF 897, ORF 900, ORF 902, ORF 923, ORF 1015, ORF 1018, ORF 1059, ORF 
10 1060, ORF 1069, ORF 1071, ORF 1073, ORF 1076, ORF 1 189, and their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide containing RGD 
(Arg-Gly-Asp) attachment sites or one of its representative fragments: 

15 ( a ) RGD-containing proteins that are outer membrane proteins, are more likely to 

play a role in cell attachment. ORFs that encoded a protein containing an RGD sequence and also 
were classified as outer membrane proteins are ORF 488, ORF 489, ORF 571, ORF 572, ORF 573 or 
ORF 716, and its representative fragments. 

20 (b) The outer membrane of Chlamydia is made of cysteine-rich proteins that form 

a network of both intra and inter molecular disulfide links. This contributes to the integrity of the 
membrane since Chlamydia lacks the peptidoglycan layer that other gram-negative bacteria have. 
Cysteine-rich proteins that have the RGD sequence are also considered to be potential vaccine 
candidates. Cysteine-rich proteins were defined as proteins that had more than 3.0% cysteine in their 

25 primary amino acid sequence, above the mean genomic ORF cysteine content. The corresponding 
ORF is: ORF 1144 and one of its representative fragments. 

(c) The outer membrane of Chlamydia may also contain small proteins that have 
cysteines in their N- and C-terminus that may contribute to the network formed by disulfide linkages. 

30 These proteins may be anchored in the outer membrane via their N-terminus and may have their C- 
terminus exposed, which then can interact with the host cells. Alternatively, these proteins may be 
anchored in the outer membrane via both N-and C-terminus and may have regions in the middle that 
may be exposed which can in turn interact with the host cells. ORFs encoding polypeptides that 
contain cysteines in their first 30 amino acids and also contain an RGD sequence are: ORF 101, ORF 

35 122, ORF 308, ORF 488, ORF 489, ORF 571, ORF 572, ORF 573, ORF 651, ORF 679, ORF 680, 
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ORF 705, ORF 716, ORF 763, ORF 870, ORF 878, ORF 879, ORF 995, ORF 1028, ORF 1029, ORF 
1176, and one of their representative fragments. 

(d) RGD-containing ORFs homologous to RGD-containing ORFs from 
5 Chlamydia pneumoniae are: 

ORF 28, ORF 101, ORF 125, ORF 155, ORF 156, ORF 286, ORF 571, ORF 572, ORF 573, ORF 
763, ORF 870, and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 

10 invention, characterized in that they encode a Chlamydia trachomatis cell wall anchored surface 
polypeptide or one of its representative fragments, said nucleotide sequences comprising a nucleotide 
sequence chosen from the following sequences: ORF 662, ORF 681, ORF 1 182, ORF 1 192, and their 
representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 

15 invention, characterized in that they encode Chlamydia trachomatis polypeptides not found in 
Chlamydia pneumoniae (Blastp P>e"'°), said nucleotide sequences comprising a nucleotide sequence 
chosen from the following sequences: ORF 2, ORF 18, ORF 60, ORF 66, ORF 67, ORF 68, ORF 69, 
ORF 70, ORF 81, ORF 89, ORF 107, ORF 108, ORF 109, ORF 134, ORF 147, ORF 191, ORF 194, 
ORF 216, ORF 217, ORF 218, ORF 219, ORF 220, ORF 221, ORF 222, ORF 223, ORF 224, ORF 

20 225, ORF 228, ORF 235, ORF 257, ORF 276, ORF 277, ORF 278, ORF 279, ORF 280, ORF 281, 
ORF 282, ORF 283, ORF 284, ORF 285, ORF 289, ORF 291, ORF 298, ORF 313, ORF 314, ORF 
315, ORF 316, ORF 334, ORF 335, ORF 336, ORF 337, ORF 338, ORF 339, ORF 340, ORF 381, 
ORF 393, ORF 413, ORF 418, ORF 419, ORF 420, ORF 421, ORF 422, ORF 423, ORF 436, ORF 
460, ORF 475, ORF 476, ORF 480, ORF 485, ORF 487, ORF 491, ORF 492, ORF 493, ORF 494, 

25 ORF 496, ORF 500, ORF 504, ORF 514, ORF 527, ORF 559, ORF 569, ORF 570, ORF 575, ORF 
580, ORF 582, ORF 593, ORF 598, ORF 632, ORF 640, ORF 651, ORF 671, ORF 690, ORF 694, 
ORF 698, ORF 710, ORF 722, ORF 723, ORF 724, ORF 770, ORF 771, ORF 782, ORF 783, ORF 
784, ORF 790, ORF 795, ORF 798, ORF 805, ORF 810, ORF 817, ORF 829, ORF 830, ORF 864, 
ORF 866, ORF 876, ORF 887, ORF 892, ORF 899, ORF 913, ORF 921, ORF 933, ORF 938, ORF 

30 949, ORF 956, ORF 1010, ORF 1017, ORF 1018, ORF 1027, ORF 1030, ORF 1037, ORF 1038, ORF 
1047, ORF 1072, ORF 1074, ORF 1075, ORF 1078, ORF 1079, ORF 1081, ORF 1083, ORF 1084, 
ORF 1087, ORF 1088, ORF 1089, ORF 1091, ORF 1092, ORF 1094, ORF 1095, ORF 1096, ORF 
1098, ORF 1104, ORF 1105, ORF 1106, ORF 1108, ORF 1110, ORF 1114, ORF 1115, ORF 1116, 
ORF 1117, ORF 1119, ORF 1128, ORF 1132, ORF 1133, ORF 1135, ORF 1136, ORF 1139, ORF 

35 1140, ORF 1141, ORF 1142, ORF 1144, ORF 1148, ORF 1151, ORF 1155, ORF 1157, ORF 1159, 
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ORF 1161, ORF 1162, ORF 1165, ORF 1166, ORF 1167, ORF 1168, ORF 1169, ORF 1171, ORF 
1172, ORF 1173, ORF 1174, ORF 1175, ORF 1176, ORF 1177, ORF 1178, ORF 1180, ORF 1181, 
ORF 1183, ORF 1184, ORF 1186, ORF 1187, ORF 1188, ORF 1192, ORF 1194, ORF 1197, and their 
representative fragments. 

5 Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the intermediate metabolism, in particular in the 

metabolism of sugars and/or of cofactors, such as for example those phosphate isomerase or pyruvate 

kinase, and in that they comprise a nucleotide sequence chosen from the following sequences: 
10 ORF10; ORF44; ORF45; ORF46; ORF47; ORF93; ORF101; ORF102; ORF103; ORF106; ORF107; 

ORF120; ORF121; ORF130; ORF135; ORF140; ORF143; ORF144; ORF145; ORF158; ORF159; 

ORF160; ORF161; ORF192; ORF193; ORF196; ORF197; ORF198; ORF199; ORF227; ORF229; 

ORF236; ORF236; ORF239; ORF243; ORF245; ORF264; ORF265; ORF297; ORF331; ORF333; 

ORF359; ORF360; ORF374; ORF404; ORF405; ORF405; ORF410; ORF415; ORF415; ORF416; 
15 ORF417; ORF432; ORF460; ORF461; ORF462; ORF495; ORF513; ORF515; ORF566; ORF566; 

ORF566; ORF589; ORF613; ORF645; ORF646; ORF647; ORF652; ORF653; ORF654; ORF672; 

ORF673; ORF674; ORF682; ORF684; ORF692; ORF700; ORF725; ORF801; ORF802; ORF835; 

ORF836; ORF837; ORF860; ORF861; ORF862; ORF863; ORF869; ORF869; ORF925; ORF964; 

ORF983 and one of their representative fragments. 
20 Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the intermediate metabolism of nucleotides or nucleic 

acids, such as for example CTP synthetase or GMP synthetase, and in that they comprise a nucleotide 

sequence chosen from the following sequences: 
25 ORF142; ORF142; ORF169; ORF256; ORF268; ORF325; ORF352; ORF366; ORF435; ORF444; 

ORF528; ORF529; ORF530; ORF548; ORF549; ORF601; ORF602; ORF617; ORF619; ORF644; 

ORF745; ORF971; ORF972; ORF1023 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
30 representative fragments which is involved in the metabolism of nucleic acids, such as for example 

DNA polymerases or DNA topoisomerases, and in that they comprise a nucleotide sequence chosen 

from the following sequences: 

ORF5; ORF12; ORF82; ORF96; ORF97; ORF98; ORF99; ORF100; ORF105; ORF118; ORF136; 
ORF137; ORF163; ORF190; ORF204; ORF259; ORF260; ORF262; ORF290; ORF300; ORF301; 
35 ORF302; ORF387; ORF427; ORF434; ORF441; ORF444; ORF471; ORF595; ORF596; ORF597; 
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ORF599; ORF600; ORF605; ORF612; ORF624; ORF625; ORF650; ORF657; ORF658; ORF702; 

ORF703; ORF704; ORF708; ORF719; ORF766; ORF767; ORF775; ORF779; ORF787; ORF788; 

ORF794; ORF841; ORF842; ORF883; ORF884; ORF907; ORF918; ORF924; ORF92S; ORF929; 

ORF962; ORF962; ORF963; ORF969; ORF970; ORF975; ORF979; ORF995; ORF1031; ORF1032 
5 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the metabolism of amino acids or polypeptides, such as 

for example serine hydroxymethyl transferase or the proteins which load amino acids onto transfer 
10 RNAs, and in that they comprise a nucleotide sequence chosen from the following sequences: 

ORF27; ORF41; ORF55; ORF56; ORF57; ORF59; ORF62; ORF63; ORF64; ORF65; ORF119; 

ORF132; ORF240; ORF241; ORF277; ORF278; ORF279; ORF382; ORF406; ORF428; ORF442; 

ORF446; ORF447; ORF453; ORF454; ORF541; ORF542; ORF591; ORF608; ORF609; ORF610; 

ORF618; ORF648; ORF649; ORF660; ORF661; ORF677; ORF717; ORF765; ORF797; ORF871; 
15 ORF875; ORF920; ORF922; ORF937; ORF998; ORF1020; ORF1021; ORF1034; ORF1044; 

ORF1046; ORF1049 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the metabolism of polypeptides, such as for example 
20 protein kinases or proteases, and in that they comprise a nucleotide sequence chosen from the 

following sequences: 

ORF21; ORF22; ORF23; ORF24; ORF25; ORF26; ORF75; ORF84; ORF86; ORF92; ORF133; 

ORF151; ORF152; ORF157; ORF179; ORF209; ORF307; ORF326; ORF343; ORF344; ORF345; 

ORF371; ORF429; ORF519; ORF557; ORF586; ORF587; ORF630; ORF656; ORF706; ORF707; 
25 ORF730; ORF751; ORF752; ORF786; ORF847; ORF885; ORF923; ORF978; ORF1039; ORF1048 

and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the metabolism of fatty acids, such as for example 
30 succinyl-CoA-synthesizing proteins or phosphatidylserine synthetase, and in that they comprise a 

nucleotide sequence chosen from the following sequences: 

ORF4; ORF15; ORF16; ORF141; ORF173; ORF205; ORF205; ORF206; ORF207; ORF208; 
ORF312; ORF355; ORF415; ORF550; ORF558; ORF560; ORF561; ORF574; ORF574; ORF577; 
ORF578; ORF590; ORF614; ORF772; ORF808; ORF809; ORF904; ORF905; ORF905; ORF933; 
35 ORF934; ORF934; ORF936 and one of their representative fragments. 
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Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
representative fragments which is involved in the synthesis of the wall, such as for example 
KDO transferase, and the proteins responsible for the attachment of certain sugars onto the exposed 
5 proteins, and in that they comprise a nucleotide sequence chosen from the following sequences: 

ORF87; ORF196; ORF242; ORF269; ORF628; ORF629; ORF634; ORF635; ORF637; ORF638; 
ORF1019 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
10 representative fragments which is involved in the transcription, translation and/or maturation process, 
such as for example initiation factors, RNA polymerases or certain chaperone proteins, and in that 
they comprise a nucleotide sequence chosen from the following sequences: 

ORF112; ORF113; ORF332; ORF212; ORF213; ORF350; ORF362; ORF363; ORF364; ORF407; 

ORF451; ORF546; ORF643; ORF744; ORF746; ORF833; ORF868; ORF981; ORF982; ORF1003; 
15 ORF101 1; ORF1042 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis ribosomal polypeptide or one of 

its representative fragments, such as for example the ribosomal proteins L21, L27 and S 10, and in that 

they comprise a nucleotide sequence chosen from the following sequences: 
20 ORF114; ORF115; ORF116; ORF328; ORF361; ORF375; ORF445; ORF543; ORF584; ORF585; 

ORF743; ORF813; ORF941; ORF942; ORF944; ORF946; ORF947; ORF948; ORF950; ORF951; 

ORF952; ORF953; ORF954; ORF955; ORF955; ORF957; ORF958; ORF960; ORF961; ORF1040; 

ORF1041; ORF1043; ORF1063; ORF1064 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
25 invention, characterized in that they encode a Chlamydia trachomatis transport polypeptide or one of 

its representative fragments, such as for example the proteins for transporting amino acids, sugars and 

certain oligopeptides, and in that they comprise a nucleotide sequence chosen from the following 

sequences: 

ORF6; ORF50; ORF51; ORF80; ORF125; ORF126; ORF128; ORF129; ORF215; ORF246; ORF248; 
30 ORF249; ORF251; ORF252; ORF253; ORF255; ORF271; ORF275; ORF293; ORF309; ORF323; 

ORF324; ORF398; ORF401; ORF449; ORF511; ORF512; ORF564; ORF565; ORF667; ORF679; 

ORF680; ORF711; ORF712; ORF713; ORF714; ORF715; ORF730; ORF731; ORF736; ORF737; 

ORF738; ORF870; ORF908; ORF919; ORF977; ORF987; ORF988; ORF992; ORF993; ORF994; 

ORF1028; ORF1029 and one of their representative fragments. 
35 Preferably, the invention also relates to the nucleotide sequences according to the 
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invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
representative fragments which is involved in the virulence process, such as for example the proteins 
analogous to the Escherichia coli vacB protein, and in that they comprise a nucleotide sequence 
chosen from the following sequences: 
5 ORF20; ORF815; ORF816; ORF898; ORF1059; ORF1060 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
representative fragments which is involved in the secretory system and/or which is secreted, such as 
for example proteins homologous to proteins in the secretory system of certain bacteria such as the 
10 Salmonellae or the Yersiniae, and in that they comprise a nucleotide sequence chosen from the 
following sequences: 

ORF758; ORF888; ORF889; ORF890; ORF891; ORF896; ORF897; ORF898 and one of their 
representative fragments. 

Preferably, the invention also relates to nucleotide sequences according to the 
15 invention, characterized in that they encode a polypeptide specific to Chlamydiae or one of its 
representative fragments, and in that they comprise a nucleotide sequence chosen from the following 
sequences: 

ORF22; ORF29; ORF31; ORF32; ORF34; ORF35; ORF39; ORF40; ORF43; ORF48; ORF49; 

ORF50; ORF52; ORF53; ORF54; ORF72; ORF77; ORF78; ORF87; ORF90; ORF95; ORF108; 
20 ORF110; ORF111; ORF122; ORF123; ORF124; ORF127; ORF138; ORF144; ORF146; ORF153; 

ORF155; ORF164; ORF166; ORF175; ORF182; ORF184; ORF186; ORF187; ORF188; ORF202; 

ORF210; ORF247; ORF258; ORF266; ORF267; ORF270; ORF273; ORF274; ORF295; ORF296; 

ORF305; ORF306; ORF309; ORF318; ORF319; ORF322; ORF326; ORF342; ORF357; ORF376; 

ORF379; ORF380; ORF388; ORF390; ORF400; ORF431; ORF433; ORF438; ORF443; ORF456; 
25 ORF457; ORF458; ORF464; ORF468; ORF470; ORF473; ORF486; ORF489; ORF497; ORF501; 

ORF503; ORF504; ORF508; ORF512; ORF521; ORF522; ORF523; ORF524; ORF533; ORF535; 

ORF536; ORF537; ORF538; ORF539; ORF540; ORF554; ORF563; ORF572; ORF579; ORF595; 

ORF603; ORF604; ORF606; ORF607; ORF615; ORF616; ORF622; ORF641; ORF642; ORF659; 

ORF668; ORF670; ORF693; ORF695; ORF696; ORF699; ORF703; ORF704; ORF716; ORF726; 
30 ORF728; ORF739; ORF742; ORF747; ORF750; ORF751; ORF755; ORF757; ORF759; ORF761; 

ORF762; ORF763; ORF764; ORF773; ORF780; ORF781; ORF789; ORF800; ORF803; ORF804; 

ORF818; ORF820; ORF822; ORF823; ORF824; ORF827; ORF828; ORF839; ORF849; ORF850; 

ORF851; ORF852; ORF855; ORF856; ORF857; ORF858; ORF859; ORF860; ORF861; ORF862; 

ORF863; ORF865; ORF868; ORF869; ORF870; ORF871; ORF872; ORF873; ORF874; ORF875; 
35 ORF877; ORF878; ORF880; ORF882; ORF884; ORF886; ORF893; ORF901; ORF906; ORF910; 
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ORF912; ORF915; ORF916; ORF917; ORF926; ORF929; ORF933; ORF965; ORF967; ORF968; 
ORF984; ORF986; ORF989; ORF990; ORF996; ORF997; ORF1001; ORF1002; ORF1013; 
ORF1016; ORF1031; ORF1033; ORF1035; ORF1049; ORF1051; ORF1052; ORF1054; ORF1056; 
ORF1057; ORF1058; ORF1062; ORF1070; ORF1071; ORF1073 and one of their representative 
5 fragments. 

Also forming part of the invention are polypeptides encoded by the polynucleotides 
of the invention, as well as fusion polypeptides comprising such polypeptides. In one embodiment, 
the polypeptides and fusion polypeptides immunoreact with seropositive serum of an individual 
infected with Chlamydia trachomatis. For example, described below, are polypeptide sequences 
10 exhibiting particularly preferable characteristics. For each group of preferred polypeptides described 
below, it is to be understood that in addition to the individual polypeptides listed, in instances wherein 
such polypeptides are encoded as part of «combined» ORFs, such «combined» polypeptides are also 
to be included within the preferred group. 

The subject of the invention is also a polypeptide according to the invention, 
15 characterized in that it is a polypeptide of the cellular envelope, preferably of the outer cellular 
envelope, of Chlamydia trachomatis or one of its representative fragments. According to the 
invention, the said polypeptide is preferably chosen from the polypeptides having the following 
sequences: 

SEQ ID No. 3; SEQ ID No. 19; SEQ ID No. 51 ; SEQ ID No. 189; SEQ ID No. 212; SEQ ID No. 213; 
20 SEQ ID No. 324; SEQ ID No. 477; SEQ ID No. 478; SEQ ID No. 479; SEQ ID No. 481; 

SEQ ID No. 482; SEQ ID No. 483; SEQ ID No. 484; SEQ ID No. 486; SEQ ID No. 488; 

SEQ ID No. 489; SEQ ID No. 490; SEQ ID No. 572; SEQ ID No. 573; SEQ ID No. 742; 

SEQ ID No. 817; SEQ ID No. 818; SEQ ID No. 820; SEQ ID No. 1035; SEQ ID No. 1036; 

SEQ ID No. 1037; SEQ ID No. 1038; SEQ ID No. 1070; SEQ ID No. 1071; SEQ ID No. 1073 and 
25 one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia trachomatis transmembrane polypeptide or one of its 

representative fragments, having between 1 and 3 transmembrane domains, and in that it is chosen 

from the polypeptides having the following sequences: 
30 SEQ ID No. 2; SEQ ID No. 3; SEQ ID No. 5; SEQ ID No. 8; SEQ ID No. 9; SEQ ID No. 10; 

SEQ ID No. 11; SEQ ID No. 12; SEQ ID No. 17; SEQ ID No. 21; SEQ ID No. 26; SEQ ID No. 27; 

SEQ ID No. 28; SEQ ID No. 29; SEQ ID No. 30; SEQ ID No. 31; SEQ ID No. 33; SEQ ID No. 35; 

SEQ ID No. 37; SEQ ID No. 39; SEQ ID No. 40; SEQ ID No. 41 ; SEQ ID No. 42; SEQ ID No. 43; 

SEQ ID No. 44; SEQ ID No. 45; SEQ ID No. 46; SEQ ID No. 47; SEQ ID No. 48; SEQ ID No. 49; 
35 SEQ ID No. 52; SEQ ID No. 53; SEQ ID No. 55; SEQ ID No. 56; SEQ ID No. 58; SEQ ID No. 65; 
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SEQIDNo.66; SEQ ID No. 68; SEQIDNo.70; SEQIDNo.74; SEQIDNo.75; SEQ ID No. 76 
SEQIDNo.78; SEQ ID No. 79; SEQ ID No. 81; SEQ ID No. 82; SEQ ID No. 83; SEQ ID No. 86 
SEQ ID No. 91; SEQ ID No. 92; SEQ ID No. 94; SEQ ID No. 97; SEQ ID No. 100; SEQ ID No. 102 



SEQ ID No. 103; 
5 SEQ ID No. 110; 
SEQ ID No. 115; 
SEQ ID No. 123; 
SEQ ID No. 140; 
SEQ ID No. 147; 
10 SEQ ID No. 155; 
SEQ ID No. 165; 
SEQ ID No. 170; 
SEQ ID No. 177; 
SEQ ID No. 187; 
15 SEQ ID No. 194; 
SEQ ID No. 199; 
SEQ ID No. 207; 
SEQ ID No. 219; 
SEQ ID No. 224; 
20 SEQ ID No. 232; 
SEQ ID No. 244; 
SEQ ID No. 252; 
SEQ ID No. 263; 
SEQ ID No. 271; 
25 SEQ ID No. 278; 
SEQ ID No. 285; 
SEQ ID No. 294; 
SEQ ID No. 311; 
SEQ ID No. 320; 
SEQ ID No. 327; 
SEQ ID No. 333; 
SEQ ID No. 339; 
SEQ ID No. 348; 
SEQ ID No. 353; 
35 SEQ ID No. 362; 
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SEQ ID No. 105; 
SEQ ID No. Ill; 
SEQ ID No. 116; 
SEQ ID No. 130; 
SEQ ID No. 141; 
SEQ ID No. 148; 
SEQ ID No. 156; 
SEQ ID No. 166; 
SEQ ID No. 171; 
SEQ ID No. 181; 
SEQ ID No. 188; 
SEQ ID No. 195; 
SEQ ID No. 201; 
SEQ ID No. 209; 
SEQ ID No. 220; 
SEQ ID No. 225; 
SEQ ID No. 234; 
SEQ ID No. 245; 
SEQ ID No. 254; 
SEQ ID No. 265; 
SEQ ID No. 272; 
SEQ ID No. 279; 
SEQ ID No. 287; 
SEQ ID No. 298; 
SEQ ID No. 313; 
SEQ ID No. 322; 
SEQ ID No. 328; 
SEQ ID No. 334; 
SEQ ID No. 340; 
SEQ ID No. 349; 
SEQ ID No. 356; 
SEQ ID No. 366; 



SEQ ID No. 106 
SEQ ID No. 112 
SEQ ID No. 117 
SEQ ID No. 134 
SEQ ID No. 143 
SEQ ID No. 149 
SEQ ID No. 162 
SEQ ID No. 167 
SEQ ID No. 173 
SEQ ID No. 183: 
SEQ ID No. 190; 
SEQ ID No. 196 
SEQ ID No. 202 
SEQ ID No. 212 
SEQ ID No. 221 
SEQ ID No. 227 
SEQ ID No. 236 
SEQ ID No. 247: 
SEQ ID No. 257 
SEQ ID No. 266 
SEQ ID No. 274; 
SEQ ID No. 282 
SEQ ID No. 289 
SEQ ID No. 305: 
SEQ ED No. 315 
SEQ ID No. 323 
SEQ ID No. 330; 
SEQ ID No. 335 
SEQ ID No. 341 
SEQ ID No. 350; 
SEQ ID No. 357 
SEQ ID No. 367 



SEQ ID No. 107; 
SEQ ID No. 113; 
SEQ ID No. 120; 
SEQ ID No. 135; 
SEQ ID No. 144; 
SEQ ID No. 150; 
SEQ ID No. 163; 
SEQ ID No. 168; 
SEQ ID No. 175; 
SEQ ID No. 184; 
SEQ ID No. 191; 
SEQ ID No. 197; 
SEQ ID No. 204; 
SEQ ID No. 213; 
SEQ ID No. 222; 
SEQ ID No. 228; 
SEQ ID No. 237; 
SEQ ID No. 248; 
SEQ ID No. 260; 
SEQ ID No. 267; 
SEQ ID No. 276; 
SEQ ID No. 283; 
SEQ ID No. 290; 
SEQ ID No. 306; 
SEQ ID No. 316; 
SEQ ID No. 325; 
SEQ ID No. 331; 
SEQ ID No. 336; 
SEQ ID No. 344; 
SEQ ID No. 351; 
SEQ ID No. 358; 
SEQ ID No. 368; 



SEQ ID No. 109 
SEQ ID No. 1 14 
SEQ ID No. 122 
SEQ ID No. 137 
SEQ ID No. 145 
SEQ ID No. 151 
SEQ ID No. 164 
SEQ ID No. 169 
SEQ ID No. 176 
SEQ ID No. 186 
SEQ ID No. 192: 
SEQ ID No. 198 
SEQ ID No. 206 
SEQ ID No. 217 
SEQ ID No. 223 
SEQ ID No. 231 
SEQ ID No. 243 
SEQ ID No. 249 
SEQ ID No. 261 
SEQ ID No. 270: 
SEQ ID No. 277: 
SEQ ID No. 284 
SEQ ID No. 291 
SEQ ID No. 310; 
SEQ ID No. 319 
SEQ ID No. 326 
SEQ ID No. 332: 
SEQ ID No. 338: 
SEQ ID No. 345: 
SEQ ID No. 352: 
SEQ ID No. 361 
SEQ ID No. 370 
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SEQ ID No. 372; 

SEQ ID No. 379; 

SEQ ID No. 385; 

SEQ ID No. 393; 
5 SEQ ID No. 404; 

SEQ ID No. 417; 

SEQ ID No. 426; 

SEQ ID No. 434; 

SEQ ID No. 443; 
10 SEQ ID No. 454; 

SEQ ID No. 463; 

SEQ ID No. 469; 

SEQ ID No. 476; 

SEQ ID No. 483; 
15 SEQ ID No. 488; 

SEQ ID No. 498; 

SEQ ID No. 508; 

SEQ ID No. 521; 

SEQ ID No. 529; 
20 SEQ ID No. 536; 

SEQ ID No. 542; 

SEQ ID No. 547; 

SEQ ID No. 558; 

SEQ ID No. 566; 
25 SEQ ID No. 572; 

SEQ ID No. 582; 

SEQ ID No. 593; 

SEQ ID No. 601; 

SEQ ED No. 609; 
30 SEQ ID No. 617; 

SEQ ID No. 624; 

SEQ ID No. 634; 

SEQ ED No. 641; 
SEQ ID No. 651; 
35 SEQ ID No. 658; 



SEQ ED No. 373; 
SEQ ID No. 380; 
SEQ ED No. 387; 
SEQ ID No. 396; 
SEQ ED No. 406; 
SEQ ED No. 418; 
SEQ ID No. 427; 
SEQ ED No. 435; 
SEQ ED No. 446; 
SEQ ID No. 455; 
SEQ ID No. 464; 
SEQ ED No. 470; 
SEQ ID No. 477; 
SEQ ID No. 484; 
SEQ ED No. 491; 
SEQ ID No. 500; 
SEQ ID No. 512; 
SEQ ED No. 523; 
SEQ ID No. 530; 
SEQ ED No. 537; 
SEQ ID No. 543; 
SEQ ED No. 551; 
SEQ ED No. 559; 
SEQ ID No. 567; 
SEQ ED No. 574; 
SEQ ED No. 585; 
SEQ ED No. 595; 
SEQ ED No. 602; 
SEQ ID No. 610; 
SEQ ID No. 618; 
SEQ ID No. 625; 
SEQ ED No. 635; 
SEQ ED No. 643; 
SEQ ED No. 652; 
SEQ ID No. 664; 
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SEQ ED No. 375; 
SEQ ED No. 382; 
SEQ ED No. 389; 
SEQ ED No. 398; 
SEQ ED No. 407; 
SEQ ID No. 420; 
SEQ ED No. 428; 
SEQ ED No. 436; 
SEQ ED No. 448; 
SEQ ED No. 457; 
SEQ ED No. 466; 
SEQ ID No. 473; 
SEQ ED No. 479; 
SEQ ED No. 485; 
SEQ ED No. 493; 
SEQ ED No. 501; 
SEQ ED No. 513; 
SEQ ED No. 524; 
SEQ ED No. 531; 
SEQ ED No. 538; 
SEQ ED No. 544; 
SEQ ED No. 552; 
SEQ ED No. 560; 
SEQ ID No. 568; 
SEQ ED No. 575; 
SEQ ED No. 587; 
SEQ ED No. 596; 
SEQ ED No. 603; 
SEQ ED No. 611; 
SEQ ED No. 621; 
SEQ ID No. 628; 
SEQ ED No. 637; 
SEQ ED No. 646; 
SEQ ED No. 653; 
SEQ ED No. 665; 



SEQ ED No. 377; 
SEQ ID No. 383; 
SEQ ED No. 390; 
SEQ ID No. 399; 
SEQ ED No. 413; 
SEQ ED No. 421; 
SEQ ED No. 430; 
SEQ ID No. 437; 
SEQ ID No. 450; 
SEQ ID No. 458; 
SEQ ED No. 467; 
SEQ ED No. 474; 
SEQ ED No. 480; 
SEQ ED No. 486; 
SEQ ED No. 496; 
SEQ ID No. 503; 
SEQ ED No. 514; 
SEQ ED No. 526; 
SEQ ED No. 532; 
SEQ ED No. 540; 
SEQ ED No. 545; 
SEQ ED No. 553; 
SEQ ED No. 561; 
SEQ ED No. 569; 
SEQ ED No. 576; 
SEQ ID No. 589; 
SEQ ED No. 597; 
SEQ ED No. 604; 
SEQ ED No. 615; 
SEQ ED No. 622; 
SEQ ID No. 632; 
SEQ ID No. 638; 
SEQ ID No. 648; 
SEQ ED No. 654; 
SEQ ID No. 666; 



SEQ ED No. 378; 
SEQ ID No. 384; 
SEQ ID No. 391; 
SEQ ED No. 403; 
SEQ ED No. 414; 
SEQ ED No. 424; 
SEQ ED No. 433; 
SEQ ID No. 440; 
SEQ ED No. 451; 
SEQ ED No. 459; 
SEQ ED No. 468; 
SEQ ED No. 475; 
SEQ ED No. 481; 
SEQ ED No. 487; 
SEQ ED No. 497; 
SEQ ID No. 504; 
SEQ ED No. 519; 
SEQ ID No. 527; 
SEQ ED No. 534; 
SEQ ED No. 541; 
SEQ ED No. 546; 
SEQ ED No. 555; 
SEQ ED No. 562; 
SEQ ED No. 571; 
SEQ ID No. 580; 
SEQ ED No. 592; 
SEQ ID No. 599; 
SEQ ED No. 608; 
SEQ ED No. 616; 
SEQ ID No. 623; 
SEQ ED No. 633; 
SEQ ED No. 640; 
SEQ ID No. 649; 
SEQ ED No. 655; 
SEQ ID No. 668; 
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SEQ ID No. 669; 

SEQ ID No. 674; 

SEQ ID No. 682; 

SEQ ID No. 689; 
5 SEQ ID No. 695; 

SEQ ID No. 704; 

SEQ ID No. 710; 

SEQ ID No. 715; 

SEQ ID No. 722; 
10 SEQ ID No. 730; 

SEQ ID No. 737; 

SEQ ID No. 743; 

SEQ ID No. 750; 

SEQ ID No. 755; 
15 SEQ ID No. 764; 

SEQ ID No. 772; 

SEQ ID No. 777; 

SEQ ID No. 782; 

SEQ ID No. 789; 
20 SEQ ID No. 802; 

SEQ ID No. 810; 

SEQ ID No. 820; 

SEQ ID No. 828; 

SEQ ID No. 835; 
25 SEQ ID No. 841; 

SEQ ID No. 849; 

SEQ ID No. 855; 

SEQ ID No. 862; 

SEQ ID No. 872; 
30 SEQ ID No. 880; 

SEQ ID No. 886; 

SEQ ID No. 895; 

SEQ ID No. 904; 

SEQ ID No. 914; 
35 SEQ ID No. 923; 



SEQ ID No. 670; 
SEQ ID No. 676; 
SEQ ID No. 683; 
SEQ ID No. 690; 
SEQ ID No. 696; 
SEQ ID No. 705; 
SEQ ID No. 711; 
SEQ ID No. 717; 
SEQ ID No. 724; 
SEQ ID No. 731; 
SEQ ID No. 738; 
SEQ ID No. 744; 
SEQ ID No. 751; 
SEQ ID No. 757; 
SEQ ID No. 766; 
SEQ ID No. 773; 
SEQ ID No. 778; 
SEQ ID No. 783; 
SEQ ID No. 790; 
SEQ ID No. 803; 
SEQ ID No. 811; 
SEQ ID No. 822; 
SEQ ID No. 829; 
SEQ ID No. 837; 
SEQ ID No. 842; 
SEQ ID No. 850; 
SEQ ID No. 856; 
SEQ ID No. 863; 
SEQ ID No. 873; 
SEQ ID No. 881; 
SEQ ID No. 887; 
SEQ ID No. 897; 
SEQ ID No. 906; 
SEQ ID No. 917; 
SEQ ID No. 924; 
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SEQ ID No. 671; 
SEQ ID No. 677; 
SEQ ED No. 684; 
SEQ ID No. 691; 
SEQ ID No. 698; 
SEQ ID No. 706; 
SEQ ID No. 712; 
SEQ ID No. 718; 
SEQ ID No. 726; 
SEQ ID No. 732; 
SEQ ID No. 739; 
SEQ ID No. 745; 
SEQ ID No. 752; 
SEQ ID No. 758; 
SEQ ID No. 768; 
SEQ ID No. 774; 
SEQ ID No. 779; 
SEQ ID No. 786; 
SEQ ID No. 793; 
SEQ ID No. 806; 
SEQ ID No. 813; 
SEQ ID No. 824; 
SEQ ID No. 830; 
SEQ ID No. 838; 
SEQ ID No. 843; 
SEQ ID No. 851; 
SEQ ID No. 857; 
SEQ 3D No. 864; 
SEQ ID No. 874; 
SEQ ID No. 883; 
SEQ ID No. 892; 
SEQ ID No. 899; 
SEQ ID No. 909; 
SEQ ID No. 920; 
SEQ ID No. 925; 



SEQ ID No. 672; 
SEQ ID No. 678; 
SEQ ID No. 686; 
SEQ ID No. 692; 
SEQ ID No. 701; 
SEQ ID No. 707; 
SEQ ID No. 713; 
SEQ ID No. 720; 
SEQ ID No. 728; 
SEQ ID No. 733; 
SEQ ID No. 740; 
SEQ ID No. 746; 
SEQ ID No. 753; 
SEQ ID No. 759; 
SEQ ED No. 769; 
SEQ ID No. 775; 
SEQ ID No. 780; 
SEQ ID No. 787; 
SEQ ED No. 798; 
SEQ ID No. 808; 
SEQ ID No. 814; 
SEQ ID No. 825; 
SEQ ID No. 833; 
SEQ ID No. 839; 
SEQ ID No. 845; 
SEQ ID No. 852; 
SEQ ID No. 859; 
SEQ ID No. 866; 
SEQ ID No. 878; 
SEQ ID No. 884; 
SEQ ID No. 893; 
SEQ ID No. 900; 
SEQ ED No. 910; 
SEQ ID No. 921; 
SEQ ID No. 926; 



SEQ ID No. 673; 
SEQ ID No. 680; 
SEQ ID No. 688; 
SEQ ID No. 693; 
SEQ ID No. 703; 
SEQ ID No. 709; 
SEQ ID No. 714; 
SEQ ID No. 721; 
SEQ ID No. 729; 
SEQ ID No. 734; 
SEQ ID No. 742; 
SEQ ID No. 748; 
SEQ ID No. 754; 
SEQ ID No. 760; 
SEQ ID No. 771; 
SEQ ID No. 776; 
SEQ ID No. 781; 
SEQ ID No. 788; 
SEQ ID No. 800; 
SEQ ID No. 809; 
SEQ ID No. 817; 
SEQ ID No. 827; 
SEQ ID No. 834; 
SEQ ID No. 840; 
SEQ ID No. 848; 
SEQ ID No. 854; 
SEQ ID No. 860; 
SEQ ID No. 869; 
SEQ ID No. 879; 
SEQ ID No. 885; 
SEQ ID No. 894; 
SEQ ID No. 901; 
SEQ ID No. 912; 
SEQ ID No. 922; 
SEQ ID No. 927; 
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SEQ ID No. 930; 
SEQ ID No. 937 
SEQ ID No. 944 
SEQ ID No. 952 
SEQ ID No. 957: 
SEQ ID No. 963: 
SEQ ID No. 970 
SEQ ID No. 980 
SEQ ID No. 989 
SEQ ID No. 999 



SEQ ID No. 933 
SEQ ID No. 940 
SEQ ID No. 945 
SEQ ID No. 953 
SEQ ID No. 958 
SEQ ID No. 964; 
SEQ ID No. 971 
SEQ ID No. 981 
SEQ ID No. 990 
SEQ ID No. 1001 
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SEQ ID No. 934; 
SEQ ID No. 941; 
SEQ ID No. 947; 
SEQ ID No. 954; 
SEQ ID No. 960; 
SEQ ID No. 966; 
SEQ ID No. 973; 
SEQ ID No. 982; 
SEQ ID No. 991; 
SEQ ID No. 1003; 



SEQ ID No. 935 
SEQ ID No. 942 
SEQ ID No. 948 
SEQ ID No. 955 
SEQ ID No. 961 
SEQ ID No. 967; 
SEQ ID No. 974 
SEQ ID No. 984 
SEQ ID No. 995 
SEQ ID No. 1004 



SEQ ID No. 1006; SEQ ID No. 1007; SEQ ID No. 1009; SEQ ID No. 1010; 
SEQ ID No. 1012; SEQ ID No. 1013; SEQ ID No. 1014; SEQ ID No. 1016 



SEQ ID No. 1018 
SEQ ID No. 1027 

SEQ ID No. 1036; SEQ ID No. 1037 



SEQ ID No. 1020; SEQ ID No. 1021 ; SEQ ID No. 1025 
SEQ ID No. 1029; SEQ ID No. 1030; SEQ ID No. 1031 
SEQ ID No. 1038; SEQ ID No. 1039 
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SEQ ID No. 936 
SEQ ID No. 943 
SEQ ID No. 951 
SEQ ID No. 956 
SEQ ID No. 962 
SEQ ID No. 969 
SEQ ID No. 979 
SEQ ID No. 988 
SEQ ID No. 996: 
SEQ ID No. 1005 
SEQ ID No. 1011 
SEQ ID No. 1017 
SEQ ID No. 1026 
SEQ ID No. 1035 
SEQ ID No. 1040 



SEQ ID No. 1044; SEQ ID No. 1045; SEQ ID No. 1047; SEQ ID No. 1048; SEQ ID No. 1050; 
SEQ ID No. 1051; SEQ ID No. 1052; SEQ ID No. 1053; SEQ ID No. 1055; SEQ ID No. 1056; 
SEQ ID No. 1057; SEQ ID No. 1058; SEQ ID No. 1061; SEQ ID No. 1062; 
SEQ ID No. 1064; SEQ ID No. 1065; SEQ ID No. 1066; SEQ ID No. 1068: 
20 SEQ ID No. 1072; SEQ ID No. 1074; SEQ ID No. 1076 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis transmembrane polypeptide or one of its 
representative fragments, having between 4 and 6 transmembrane domains, and in that it is chosen 
from the polypeptides having the following sequences: 
25 SEQ ID No. 7; SEQ ID No. 14; SEQ ID No. 16; SEQ ID No. 32; SEQ ID No. 34; SEQ ID No. 36: 
SEQ ID No. 38; SEQ ID No. 50; SEQ ID No. 57; SEQ ID No. 59; SEQ ID No. 61; SEQ ID No. 62: 
SEQ ID No. 63; SEQ ID No. 64; SEQ ID No. 67; SEQ ID No. 69; SEQ ID No. 72; SEQ ID No. 77 
SEQ ID No. 80; SEQ ID No. 84; SEQ ID No. 87; SEQ ID No. 93; SEQ ID No. 95; SEQ ID No. 99 



SEQ ID No. 


108; 


SEQ ID No. 119 


SEQ ID No 


125 


SEQ ID No 


126; 


SEQ ID No. 


129 


30 SEQ ID No. 


131; 


SEQ ID No. 136 


SEQ ID No 


139 


SEQ ID No 


146; 


SEQ ID No. 


152 


SEQ ID No. 


154; 


SEQ ID No. 160 


SEQ ID No 


161 


SEQ ID No 


172; 


SEQ ID No. 


179 


SEQ ID No. 


182; 


SEQ ID No. 185 


SEQ ID No 


200 


SEQ ID No 


203; 


SEQ ID No. 


205 


SEQ ID No. 


239; 


SEQ ID No. 242 


SEQ ID No 


250 


SEQ ID No. 


253; 


SEQ ID No. 


256 


SEQ ID No. 259; 


SEQ ID No. 262 


SEQ ID No 


268 


SEQ ID No. 


275; 


SEQ ID No. 


281 


35 SEQ ID No. 


286; 


SEQ ID No. 288 


SEQ ID No 


292 


SEQ ID No. 


295; 


SEQ ID No. 


296 
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SEQIDNo.297; SEQIDNo.299; SEQIDNo.300; SEQIDNo. 308; SEQ ID No. 314 

SEQIDNo. 317; SEQIDNo. 318; SEQIDNo. 324; SEQIDNo. 342; SEQIDNo. 343 

SEQIDNo. 355; SEQIDNo. 360; SEQIDNo. 374; SEQ ID No. 376; SEQIDNo. 386 

SEQIDNo. 388; SEQ ID No. 392; SEQ ID No. 394; SEQ ID No. 395; SEQ ID No. 402 

SEQ ID No. 405; SEQ ID No. 411; SEQ ID No. 415; SEQ ID No. 416; SEQ ID No. 422 

SEQ ID No. 423; SEQ ID No. 429; SEQ ID No. 432; SEQ ID No. 441; SEQ ID No. 442; 

SEQ ID No. 444; SEQ ID No. 449; SEQ ID No. 452; SEQ ID No. 456; SEQ ID No. 460 

SEQ ID No. 461; SEQ ID No. 465; SEQ ID No. 471; SEQ ID No. 472; SEQ ID No. 482 

SEQ ID No. 489; SEQ ID No. 492; SEQ ID No. 494; SEQ ID No. 495; SEQ ID No. 502 

SEQ ID No. 505; SEQ ID No. 506; SEQ ID No. 509; SEQ ID No. 516; SEQ ID No. 517 

SEQ ID No. 520; SEQ ID No. 525; SEQ ID No. 533; SEQ ID No. 539; SEQ ID No. 549 

SEQ ID No. 554; SEQ ID No. 557; SEQ ID No. 563; SEQ ID No. 570; SEQ ID No. 573 

SEQ ID No. 581; SEQ ID No. 590; SEQ ID No. 591; SEQ ID No. 600; SEQ ID No. 607 

SEQ ID No. 612; SEQ ID No. 613; SEQ ID No. 620; SEQ ID No. 626; SEQ ID No. 629 

SEQ ID No. 630; SEQ ID No. 639; SEQ ID No. 644; SEQ ID No. 647; SEQ ID No. 656 

SEQ ID No. 659; SEQ ID No. 661; SEQ ID No. 685; SEQ ID No. 687; SEQ ID No. 699 

SEQ ID No. 700; SEQ ID No. 708; SEQ ID No. 716; SEQ ID No. 719; SEQ ID No. 725 

SEQ ID No. 747; SEQ ID No. 749; SEQ ID No. 756; SEQ ID No. 765; SEQ ID No. 767 

SEQ ID No. 794; SEQ ID No. 796; SEQ ID No. 797; SEQ ID No. 799; SEQIDNo. 801 

SEQ ID No. 807; SEQ ID No. 821; SEQ ID No. 823; SEQ ID No. 826; SEQ ID No. 847: 

SEQIDNo. 853; SEQIDNo. 861; SEQ ID No. 870; SEQIDNo. 871; SEQIDNo. 875 

SEQ ID No. 882; SEQ ID No. 888; SEQ ID No. 889; SEQIDNo. 898; SEQ ID No. 902 

SEQ ID No. 903; SEQ ID No. 911; SEQ ID No. 916; SEQ ID No. 931; SEQ ID No. 939 

SEQ ID No. 975; SEQ ID No. 976; SEQ ID No. 978; SEQ ID No. 983; SEQ ID No. 986: 

SEQ ID No. 987; SEQ ID No. 992; SEQ ID No. 993; SEQ ID No. 1000; SEQIDNo. 1002 

SEQ ID No. 1008; SEQ ID No. 1019; SEQ ID No. 1022; SEQ ID No. 1032; SEQ ID No. 1034 

SEQ ID No. 1046; SEQ ID No. 1054; SEQ ID No. 1060; SEQIDNo. 1071 and one of their 
representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis transmembrane polypeptide or one of its 
representative fragments, having at least 7 transmembrane domains, and in that it is chosen from the 
polypeptides having the following sequences: SEQ ID No. 4; SEQ ID No. 6; SEQ ID No. 13; 
SEQ ID No. 20; SEQ ID No. 51; SEQ ID No. 71; SEQ ID No. 88; SEQ ID No. 118; SEQ ID No. 128 

SEQIDNo. 132; SEQIDNo. 133; SEQIDNo. 158; SEQIDNo. 159; SEQIDNo. 174 

SEQIDNo. 180; SEQIDNo. 189; SEQ ID No. 210; SEQIDNo. 211; SEQ ID No. 214 
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SEQIDNo.215; SEQIDNo.226; SEQIDNo.229; SEQIDNo.233; SEQIDNo.235 
SEQIDNo.240; SEQIDNo.246; SEQEDNo.251; SEQIDNo.255; SEQ ID No. 273 
SEQIDNo.354; SEQ ID No. 364; SEQ ID No. 369; SEQ ID No. 371; SEQ ID No. 397 
SEQ ID No. 401; SEQ ID No. 409; SEQ ID No. 412; SEQ ID No. 419; SEQ ID No. 439: 
SEQ ID No. 453; SEQ ID No. 462; SEQ ID No. 490; SEQ ID No. 510; SEQ ID No. 511 
SEQ ID No. 518; SEQ ID No. 535; SEQ ID No. 548; SEQ ID No. 550; SEQ ID No. 564 
SEQ ID No. 565; SEQ ID No. 578; SEQ ID No. 579; SEQ ID No. 614; SEQ ID No. 631 
SEQ ID No. 636; SEQ ID No. 650; SEQ ID No. 662; SEQ ID No. 667; SEQ ID No. 679 
SEQ ID No. 681; SEQ ID No. 702; SEQ ID No. 727; SEQ ID No. 741; SEQ ID No. 763 
SEQ ID No. 791; SEQ ID No. 792; SEQ ID No. 815; SEQ ID No. 816; SEQ ID No. 832 
SEQ ID No. 846; SEQ ID No. 858; SEQ ID No. 865; SEQ ID No. 867; SEQ ID No. 868 
SEQ ID No. 877; SEQ ID No. 891; SEQ ID No. 896; SEQ ID No. 907; SEQ ID No. 908; 
SEQ ID No. 918; SEQ ID No. 919; SEQ ID No. 932; SEQ ID No. 959; SEQ ID No. 977 
SEQ ID No. 994; SEQ ID No. 998; SEQ ID No. 1024; SEQ ID No. 1028; SEQ ID No. 1042 
SEQ ID No. 1067; SEQ ID No. 1070; SEQ ID No. 1073 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis surface exposed polypeptide or one of its representative fragments, and in 
that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 21, SEQ ID No. 22, SEQ ID No. 23, SEQ ID No. 53, SEQ 
ID No. 77, SEQ ID No. 187, SEQ ID No. 203, SEQ ID No. 383, SEQ ID No. 477, SEQ ID No. 478, 
SEQ ID No. 479, SEQ ID No. 481, SEQ ID No. 482, SEQ ID No. 483, SEQ ID No. 484, SEQ ID No. 
485, SEQ ID No. 486, SEQ ID No. 487, SEQ ID No. 488, SEQ ID No. 489, SEQ ID No. 490, SEQ ID 
No. 571, SEQ ID No. 572, SEQ ID No. 573, SEQ ID No. 593, SEQ ID No. 670, SEQ ID No. 693, 
SEQ ID No. 742, SEQ ID No. 749, SEQ ID No. 801, SEQ ID No. 817, SEQ ID No. 818, SEQ ID No. 
819, SEQ ID No. 820, SEQ ID No. 851, SEQ ID No. 902, SEQ ID No. 923, SEQ ID No. 1035, SEQ 
ID No. 1036, SEQ ID No. 1037, SEQ ID No. 1038, SEQ ID No. 1069, SEQ ID No. 1070, SEQ ID No. 
1071, SEQ ID No. 1073, SEQ ID No. 1076, SEQ ID No. 1095, SEQ ID No. 1096, SEQ ID No. 1141, 
SEQ ID No. 1 181, and their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis lipoprotein or one of its representative fragments, 
and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 29, SEQ ID No. 42, SEQ ID No. 66, SEQ ID No. 72, SEQ ID No. 76, SEQ ID No. 78, 
SEQ ID No. 148, SEQ ID No. 154, SEQ ID No. 180, SEQ ID No. 182, SEQ ID No. 184, SEQ ID No. 
187, SEQ ID No. 200, SEQ ID No. 242, SEQ ID No. 245, SEQ ID No. 250, SEQ ID No. 253, SEQ ID 
No. 272, SEQ ID No. 274, SEQ ID No. 275, SEQ ID No. 308, SEQ ID No. 350, SEQ ID No. 362, 
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SEQ ID No. 383, SEQ ID No. 394, SEQ ID No. 396, SEQ ID No. 399, SEQ ID No. 422, SEQ ID No. 
488, SEQ ID No. 535, SEQ ID No. 568, SEQ ID No. 573, SEQ ID No. 578, SEQ ID No. 593, SEQ ID 
No. 607, SEQ ID No. 625, SEQ ID No. 662, SEQ ID No. 669, SEQ ID No. 688, SEQ ID No. 690, 
SEQ ID No. 716, SEQ ID No. 773, SEQ ID No. 778, SEQ ID No. 781, SEQ ID No. 783, SEQ ID No. 
5 788, SEQ ID No. 817, SEQ ID No. 848, SEQ ID No. 851, SEQ ID No. 853, SEQ ID No. 857, SEQ ID 
No. 875, SEQ ID No. 877, SEQ ID No. 886, SEQ ID No. 898, SEQ ID No. 902, SEQ ID No. 923, 
SEQ ID No. 938, SEQ ID No. 976, SEQ ID No. 978, SEQ ID No. 990, SEQ ID No. 1005, SEQ ID No. 
1021, SEQ ID No. 1035, SEQ ID No. 1069, SEQ ID No. 1083, SEQ ID No. 1088, SEQ ID No. 1089, 
SEQ ID No. 1091, SEQ ID No. 1092, SEQ ID No. 1095, SEQ ID No. 1096, SEQ ID No. 1 100, SEQ 

10 ID No. 1 105, SEQ ID No. 1 108, SEQ ID No. 1 1 17, SEQ ID No. 1 120, SEQ ID No. 1 121, SEQ ID No. 
1124, SEQ ID No. 1128, SEQ ID No. 1133, SEQ ID No. 1135, SEQ ID No. 1139, SEQ ID No. 1140, 
SEQ ID No. 1157, SEQ ID No. 1159, SEQ ID No. 1163, SEQ ID No. 1165, SEQ ID No. 1167, SEQ 
ID No. 1168, SEQ ID No. 1169, SEQ ID No. 1171, SEQ ID No. 1173, SEQ ID No. 1174, SEQ ID No. 
1177, SEQ ID No. 1180, SEQ ID No. 1181, SEQ ID No. 1186, SEQ ID No. 1194, SEQ ID No. 1197, 

1 5 and their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis polypeptide involved in lipopolysaccharide (LPS) biosynthesis, and in 
that it is chosen from the polypeptides having the following sequences: SEQ ID No. 17, SEQ ID No. 
201, SEQ ID No. 691, SEQ ID No. 807, SEQ ID No. 936, SEQ ID No. 983, SEQ ID No. 1019, SEQ 

20 ID No. 1077, and their representative fragments. 

Preferably, the invention relates to additional LPS-related polypeptides according to 
the invention, in that it is: 

(a) a Chlamydia trachomatis KDO (3-deoxy-D-manno-octylosonic acid)-related 
polypeptide or one of its representative fragments, and in that it is chosen from the polypeptides 

25 having the following sequences: SEQ ID No. 41, SEQ ID No. 242, SEQ ID No. 269, SEQ ID No. 772, 
and one of their representative fragments; 

(b) a Chlamydia trachomatis phosphomannomutase-related polypeptide or one of 
its representative fragments, and in that it is chosen from the polypeptides having the following 
sequence: SEQ ID No. 139, and its representative fragments; 

30 ( c ) a Chlamydia trachomatis phosphoglucomutase-related polypeptide or one of 

its representative fragments, and in that it is chosen from the polypeptides having the following 
sequence: SEQ ID No. 567 and its representative fragments; and 

(d) a Chlamydia trachomatis lipid A component-related polypeptide or one of its 
representative fragments, and in that it is chosen from the polypeptides having the following 

35 sequences: SEQ ID No. 4, SEQ ID No. 933, SEQ ID No. 934, SEQ ID No. 935, SEQ ID No. 1185, 
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and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis polypeptide or one of its representative fragments that contains an RGD 
sequence and is also an outer membrane protein, and in that it is chosen from the polypeptides having 
5 the following sequences: SEQ. ID No. 488, SEQ ID No. 489, SEQ ID No. 57 1 , SEQ ID No. 572, SEQ 
No. 573, SEQ ID No. 716 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis polypeptide or one of its representative fragments that is cysteine-rich and 
contains RGD sequence, and in that it is chosen from the polypeptides having the following sequence: 
10 SEQ ID No. 144 and one of its representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis outer membrane polypeptide that contains cysteines in their first 30 amino 
acids and also contain an RGD sequence, and in that it is chosen from the polypeptides having the 
following sequences: SEQ ID No. 101, SEQ ID No. 122, SEQ ID No. 308, SEQ ID No. 488, SEQ ID 
15 No. 489, SEQ ID No. 571, SEQ ID No. 572, SEQ ID No. 573, SEQ ID No. 651, SEQ ID No. 679, 
SEQ ID No. 680, SEQ ID No. 705, SEQ ID No. 716, SEQ ID No. 763, SEQ ID No. 870, SEQ ID No. 
878, SEQ ID No. 879, SEQ ID No. 995, SEQ ID No. 1028, SEQ ID No. 1029, SEQ ID No. 1176, and 
one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
20 is a Chlamydia trachomatis polypeptide or one of its representative fragments that contains RGD 
sequences homologous to Chlamydia pneumoniae polypeptides containing RGD sequences, and in 
that it is chosen from the polypeptides having the following sequences: SEQ ID No. 28, SEQ ID No. 
101, SEQ ID No. 125, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 286, SEQ ID No. 571, SEQ ID 
No. 572, SEQ ID No. 573, SEQ ID No. 763, SEQ ID No. 870, and one of their representative 
25 fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis Type HI or non-Type III secreted polypeptide or one of its representative 
fragments, and in that it is chosen from the polypeptides having the following sequences: 
SEQ ID No. 180, SEQ ID No. 181, SEQ ID No. 207, SEQ ID No. 208, SEQ ID No. 372, SEQ ID No. 

30 391, SEQ ID No. 399, SEQ ID No. 477, SEQ ID No. 486, SEQ ID No. 749, SEQ ID No. 758, SEQ ID 
No. 819, SEQ ID No. 878, SEQ ID No. 888, SEQ ID No. 896, SEQ ID No. 897, SEQ ID No. 900, 
SEQ ID No. 902, SEQ ID No. 923, SEQ ID No. 1015, SEQ ID No. 1018, SEQ ID No. 1059, SEQ ID 
No. 1060, SEQ ID No. 1069, SEQ ID No. 1071, SEQ ID No. 1073, SEQ ID No. 1076, SEQ ID No. 
1 189, and their representative fragments. 

35 Preferably, the invention relates to a polypeptide according to the invention, in that it 
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is a Chlamydia trachomatis cell wall anchored surface polypeptide or one of its representative 

fragments, and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 662, SEQ ID No. 681, SEQ ID No. 1182, SEQ ID No. 1192, and their representative 

fragments. 

5 Preferably, the invention relates to a polypeptide according to the invention, in that it 

is a Chlamydia trachomatis polypeptide or one of its representative fragments not found in Chlamydia 
pneumoniae (Blastp P>e" 10 ) and in that it is chosen from the polypeptides having the following 
sequences: SEQ ID No.2, SEQ ID No. 18, SEQ ID No. 60, SEQ ID No. 66, SEQ ID No. 67, SEQ ID 
No.68, SEQ ID No. 69, SEQ ID No. 70, SEQ ID No. 81, SEQ ID No. 89, SEQ ID No. 107, SEQ ID 

10 No.108, SEQ ID No. 109, SEQ ID No.134, SEQ ID No. 147, SEQ ID No.191, SEQ ID No. 194, SEQ 
ID No. 216, SEQ ID No. 217, SEQ ID No. 218, SEQ ID No. 219, SEQ ID No. 220, SEQ ID No. 221, 
SEQ ID No. 222, SEQ ID No. 222, SEQ ID No. 223, SEQ ID No. 224, SEQ ID No. 225, SEQ ID No. 
228, SEQ ID No. 235, SEQ ID No.257, SEQ ID No. 276, SEQ ID No. 277, SEQ ID No. 278, SEQ ID 
No. 279, SEQ ID No. 280, SEQ ID No. 281, SEQ ID No. 282, SEQ ID No.283, SEQ ID No. 284, 

15 SEQ ID No. 285, SEQ ID No. 289, SEQ ID No.291, SEQ ID No. 298, SEQ ID No. 284, SEQ ID No. 
313, SEQ ID No. 314, SEQ ID No. 315, SEQ ID No. 316, SEQ ID No. 334, SEQ ID No. 335, SEQ ID 
No. 336, SEQ ID No. 337, SEQ ID No. 338, SEQ ID No. 339, SEQ ID No. 340, SEQ ID No. 381, 
SEQ ID No. 393, SEQ ID No. 413, SEQ ID No. 418, SEQ ID No. 419, SEQ ID No. 419, SEQ ID No. 
420, SEQ ID No. 421, SEQ ID No. 422, SEQ ID No. 423, SEQ ID No. 436, SEQ ID No. 460, SEQ ID 

20 No. 475, SEQ ID No. 476, SEQ ID No. 480, SEQ ID No. 485, SEQ ID No. 487, SEQ ID No.49 1 , SEQ 
ID No. 492, SEQ ID No. 493, SEQ ID No. 494, SEQ ID No. 496, SEQ ID No. 500, SEQ ID No.504, 
SEQ ID No. 514, SEQ ID No. 527, SEQ ID No. 559, SEQ ID No.569, SEQ ID No. 570, SEQ ID No. 
575, SEQ ID No. 580, SEQ ID No. 582, SEQ ID No. 593, SEQ ID No. 598, SEQ ID No.632, SEQ ID 
No.640, SEQ ID No.651, SEQ ID No.671, SEQ ID No. 690, SEQ ID No. 694, ID No. 698, SEQ ID 

25 No. 710, SEQ ID No. 722, SEQ ID No. 723, SEQ ID No. 724, SEQ ID No. 770, SEQ ID No. 771, 
SEQ ID No.782, SEQ ID No. 783, SEQ ID No. 784, SEQ ID No. 790, SEQ ID No. 795, SEQ ID No. 
798, SEQ ID No. 805, SEQ ID No. 810, SEQ ID No. 817, SEQ ID No. 829, SEQ ID No. 830, SEQ ID 
No. 864, SEQ ID No. 866, SEQ ID No. 876, SEQ ID No. 887, SEQ ID No. 892, SEQ ID No. 899, 
SEQ ID No. 913, SEQ ID No. 921, SEQ ID No. 933, SEQ ID No. 938, SEQ ID No. 949, SEQ ID No. 

30 956, SEQ ID No. 1010, SEQ ID No. 1017, SEQ ID No. 1018, SEQ ID No. 1027, SEQ ID No. 1030, 
SEQ ID No. 1037, SEQ ID No. 1038, SEQ ID No. 1047, SEQ ID No. 1072, SEQ ID No. 1074, SEQ 
ID No. 1075, SEQ ID No. 1078, SEQ ID No. 1079, SEQ ID No. 1081, SEQ ID No. 1083, SEQ ID No. 
1084, SEQ ID No. 1087, SEQ ID No. 1088, SEQ ID No. 1089, SEQ ID No. 1091, SEQ ID No. 1092, 
SEQ ID No. 1094, SEQ ID No. 1095, SEQ ID No. 1096, SEQ ID No. 1098, SEQ ID No. 1104, SEQ 

35 ID No. 1 105, SEQ ID No. 1 106, SEQ ID No. 1 108, SEQ ID No. 1 1 10, SEQ ID No. 1 1 14, SEQ ID No. 



WO 99/28475 PCT/IB98/01939 
43 

1115, SEQIDNo. 1116, SEQ ID No. 1117, SEQ ID No. 1119, SEQ ID No. 1128, SEQ ID No. 1132, 
SEQ ID No. 1133, SEQ ID No. 1135, SEQ ID No. 1136, SEQ ID No. 1139, SEQ ID No. 1140, SEQ 
ID No. 1141, SEQ ID No. 1142, SEQIDNo. 1144, SEQ ID No. 1148, SEQIDNo. 1151, SEQ ID No. 
1155, SEQIDNo. 1157, SEQIDNo. 1159, SEQIDNo. 1161, SEQ ID No. 1162, SEQIDNo. 1165, 

5 SEQ ID No. 1166, SEQ ID No. 1167, SEQ ID No. 1168, SEQ ID No. 1169, SEQ ID No. 1171, SEQ 
ID No. 1 172, SEQ ID No. 1 173, SEQ ID No. 1 174, SEQ ID No. 1 175, SEQ ID No. 1 176, SEQ ID No. 
1177, SEQIDNo. 1178, SEQIDNo. 1180, SEQIDNo. 1 181, SEQ ID No. 1183, SEQIDNo. 1184, 
SEQ ID No. 1186, SEQ ID No. 1187, SEQ ID No. 1188, SEQ ID No. 1192, SEQ ID No. 1194, SEQ 
ED No. 1 197, and their representative fragments. 

0 Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the intermediate metabolism, in particular in the metabolism of sugars and/or of 
cofactors, and in that it is chosen from the polypeptides having the following sequences: 



SEQIDNo. 10; SEQ ID No. 44; SEQ ID No. 45; SEQ ID No. 46; SEQ ID No. 47; SEQ ID No. 93; 



15 


SEQIDNo. 101 


SEQ ID No. 102; 


SEQ ID No. 103 


SEQIDNo. 106; 


SEQIDNo. 107 




SEQIDNo. 120 


SEQIDNo. 121; 


SEQIDNo. 130 


SEQIDNo. 135; 


SEQ ID No. 140 




SEQ ID No. 143 


SEQ ID No. 144; 


SEQ ED No. 145 


SEQ ID No. 158; 


SEQ ID No. 159 




SEQ ID No. 160 


SEQIDNo. 161; 


SEQIDNo. 192 


SEQIDNo. 193; 


SEQ ID No. 196 




SEQ ID No. 197 


SEQIDNo. 198; 


SEQ ID No. 199 


SEQ ID No. 227; 


SEQ ID No. 229 


20 


SEQ ID No. 236 


SEQ ID No. 236; 


SEQ ID No. 239 


SEQ ID No. 243; 


SEQ ID No. 245 




SEQ ID No. 264 


SEQ ED No. 265; 


SEQ ID No. 297 


SEQIDNo. 331; 


SEQ ID No. 333 




SEQ ID No. 359 


SEQIDNo. 360; 


SEQ ID No. 374 


SEQ ID No. 404; 


SEQ ID No. 405 




SEQ ID No. 405 


SEQ ID No. 410; 


SEQ ID No. 415 


SEQ ID No. 415; 


SEQ ID No. 416 




SEQ ID No. 417 


SEQ ED No. 432; 


SEQ ID No. 460 


SEQIDNo. 461; 


SEQ ID No. 462 


25 


SEQ ID No. 495 


SEQ ID No. 513; 


SEQ ID No. 515 


SEQ ID No. 566; 


SEQ ID No. 566 




SEQ ID No. 566 


SEQIDNo. 589; 


SEQ ID No. 613 


SEQIDNo. 645; 


SEQ ID No. 646 




SEQ ID No. 647 


SEQ ID No. 652; 


SEQIDNo. 653 


SEQ ID No. 654; 


SEQ ID No. 672 




SEQ ID No. 673 


SEQ ID No. 674; 


SEQ ID No. 682 


SEQ ID No. 684; 


SEQ ID No. 692 




SEQ ID No. 700 


SEQ ID No. 725; 


SEQIDNo. 801 


SEQ ID No. 802; 


SEQ ID No. 835 


30 


SEQ ID No. 836 


SEQIDNo. 837; 


SEQIDNo. 860 


SEQIDNo. 861; 


SEQ ID No. 862 




SEQ ID No. 863 


SEQ ID No. 869; 


SEQ ID No. 869 


SEQ ID No. 925; 


SEQ ID No. 964 




SEQ ID No. 983 and one of their representative fragments. 







Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
35 which is involved in the intermediate metabolism of nucleotides or nucleic acids, and in that it is 
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chosen from the polypeptides having the following sequences: 



SEQ ID No. 142; 
SEQ ID No. 352; 
SEQ ID No. 529; 
SEQ ID No. 602; 



SEQ ID No. 169; 
SEQ ID No. 366; 
SEQ ID No. 530; 
SEQ ID No. 617: 



SEQ ID No. 256: 
SEQ ID No. 435: 
SEQ ID No. 548; 
SEQ ID No. 619; 



SEQ ID No. 268: 
SEQ ID No. 444: 
SEQ ID No. 549; 
SEQ ID No. 644; 



; SEQ ID No. 971 ; SEQ ID No. 972; SEQ ID No. 1023 and one of their representative 



SEQ ID No. 142; 
SEQ ID No. 325; 
SEQ ID No. 528; 
SEQ ID No. 601; 
SEQ ID No. 745; 
fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the metabolism of nucleic acids, and in that it is chosen from the polypeptides 
having the following sequences: 

SEQ ID No. 5; SEQ ID No. 12; SEQ ID No. 82; SEQ ID No. 96; SEQ ID No. 97; SEQ ID No. 



SEQ ID No. 99; 

SEQ ID No. 137; 
15 SEQ ID No. 260; 

SEQ ID No. 302; 

SEQ ID No. 444; 

SEQ ID No. 599; 

SEQ ID No. 625; 
20 SEQ ID No. 703; 

SEQ ID No. 767; 

SEQ ID No. 794; 

SEQ ID No. 907; 

SEQ ID No. 962; 



SEQIDNo. 100; SEQIDNo. 105; SEQIDNo. 118; 
SEQ ID No. 163; SEQ ID No. 190; SEQ ID No. 204; 
SEQ ID No. 262; SEQ ID No. 290; SEQ ID No. 300; 
SEQ ID No. 387; SEQ ID No. 427; SEQ ID No. 434; 
SEQ ID No. 471; SEQ ID No. 595; SEQ ID No. 596; 
SEQ ID No. 600; SEQ ID No. 605; SEQ ID No. 612; 
SEQ ID No. 650; SEQ ID No. 657; SEQ ID No. 658; 
SEQ ID No. 704; SEQ ID No. 708; SEQ ID No. 719: 
SEQ ID No. 775; SEQ ID No. 779; SEQ ID No. 787: 
SEQIDNo. 841; SEQ ID No. 842; SEQ ID No. 883; 
SEQ ID No. 918; SEQ ID No. 924; SEQ ID No. 928; 
SEQ ID No. 962; SEQ ID No. 963; SEQ ID No. 969; 
25 SEQ ID No. 975; SEQ ID No. 979; SEQ ID No. 995; SEQ ID No. 1031; SEQ ID No. 1032 and one of 
their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the metabolism of amino acids or polypeptides, and in that it is chosen from the 
30 polypeptides having the following sequences: 

SEQ ID No. 27; SEQ ID No. 41 ; SEQ ID No. 55; SEQ ID No. 56; SEQ ID No. 57; SEQ ID No. 59; 
SEQ ID No. 62; SEQ ID No. 63; SEQ ID No. 64; SEQ ID No. 65; SEQ ID No. 119; SEQ ID No. 132: 



SEQ ID No. 136 
SEQ ID No. 259: 
SEQ ID No. 301 
SEQ ID No. 441 
SEQ ID No. 597 
SEQ ID No. 624 
SEQ ID No. 702; 
SEQ ID No. 766: 
SEQ ID No. 788: 
SEQ ID No. 884: 
SEQ ID No. 929; 
SEQ ID No. 970; 



SEQ ID No. 240; 
SEQ ID No. 382; 
SEQIDNo. 447: 



SEQ ID No. 241 
SEQ ID No. 406; 
SEQ ID No. 453; 



SEQ ID No. 277; 
SEQ ED No. 428: 
SEQ ID No. 454; 



SEQIDNo. 278; 
SEQ ID No. 442 
SEQIDNo. 541 



SEQ ID No. 279; 
SEQ ID No. 446 
SEQ ID No. 542 
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SEQIDNo. 591 
SEQ ID No. 648 
SEQ ID No. 717 
SEQ ID No. 920: 



SEQ ID No. 608 
SEQ ID No. 649 
SEQ ID No. 765 
SEQ ID No. 922 



SEQ ID No. 609 
SEQ ID No. 660: 
SEQ ID No. 797; 
SEQIDNo. 937; 



SEQIDNo. 610; 
SEQIDNo. 661; 
SEQIDNo. 871; 
SEQIDNo. 998; 



SEQ ID No. 618 
SEQ ID No. 677 
SEQ ID No. 875 
SEQIDNo. 1020 



SEQIDNo. 1044; SEQ ID No. 1046; SEQIDNo. 1049 and 



30 



5 SEQ ID No. 1021; SEQ ID No. 1034 
one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the metabolism of polypeptides, and in that it is chosen from the polypeptides 

10 having the following sequences: 

SEQ ID No. 21; SEQ ID No. 22; SEQ ID No. 23; SEQ ID No. 24; SEQ ID No. 25; SEQ ID No. 26 
SEQ ID No. 75; SEQ ID No. 84; SEQ ID No. 86; SEQ ID No. 92; SEQ ID No. 133; SEQ ID No. 151 
SEQ ID No. 152; SEQ ID No. 157; SEQ ID No. 179; SEQ ID No. 209; SEQ ID No. 307 
SEQ ID No. 326; SEQ ID No. 343; SEQ ID No. 344; SEQ ID No. 345; SEQ ID No. 371 

15 SEQ ID No. 429; SEQ ID No. 519; SEQ ID No. 557; SEQ ID No. 586; SEQ ID No. 587 
SEQ ID No. 630; SEQ ID No. 656; SEQ ID No. 706; SEQ ID No. 707; SEQ ID No. 730 
SEQIDNo. 751; SEQ ID No. 752; SEQ ID No. 786; SEQ ID No. 847; SEQ ID No. 885. 
SEQ ID No. 923; SEQ ID No. 978; SEQ ID No. 1039; SEQIDNo. 1048 and one of their 
representative fragments. 

20 Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the metabolism of fatty acids, and in that it is chosen from the polypeptides 
having the following sequences: 

SEQ ID No. 4; SEQ ID No. 15; SEQ ID No. 16; SEQ ID No. 141; SEQ ID No. 173; SEQ ID No. 205 
25 SEQ ID No. 205; SEQ ID No. 206; SEQ ID No. 207; SEQ ID No. 208; SEQ ID No. 312 
SEQ ID No. 355; SEQ ID No. 415; SEQ ID No. 550; SEQ ID No. 558; SEQ ID No. 560 
SEQ ID No. 561; SEQ ID No. 574; SEQ ID No. 574; SEQ ID No. 577; SEQ ID No. 578 
SEQ ID No. 590; SEQ ID No. 614; SEQ ID No. 772; SEQ ID No. 808; SEQ ID No. 809 
SEQ ID No. 904; SEQ ID No. 905; SEQ ID No. 905; SEQ ID No. 933; SEQ ID No. 934; 
SEQ ID No. 934; SEQ ID No. 936 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the synthesis of the wall, and in that it is chosen from the polypeptides having the 
following sequences: 

SEQ ID No. 87; SEQIDNo. 196; SEQ ID No. 242; SEQ ID No. 269; SEQ ID No. 628; 
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SEQIDNo.629; SEQ ID No. 634; SEQIDNo.635; SEQIDNo.637; SEQIDNo.638; 
SEQ ID No. 1019 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the transcription, translation and/or maturation process, and in that it is chosen 
from the polypeptides having the following sequences: 



SEQ ID No. 112; SEQ ID No. 113 

SEQ ID No. 350; SEQ ID No. 362 

SEQ ID No. 45 1 ; SEQ ID No. 546 

SEQ ID No. 833; SEQ ID No. 868 



15 



SEQ ID No. 332; SEQ ID No. 212; SEQ ID No. 213 
SEQ ID No. 363; SEQ ID No. 364; SEQ ID No. 407 
SEQ ID No. 643; SEQ ID No. 744; SEQ ID No. 746 
SEQ ID No. 981; SEQ ID No. 982; SEQ ID No. 1003 
SEQ ID No. 101 1 ; SEQ ID No. 1042 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis ribosomal polypeptide or one of its representative 
fragments, and in that it is chosen from the polypeptides having the following sequences: 
SEQ ID No. 114; SEQ ID No. 115; SEQ ID No. 116; SEQ ID No. 328; SEQ ID No. 361 
SEQ ID No. 543; 
SEQ ID No. 941; 
SEQ ID No. 948; 
SEQ ID No. 954; 
SEQ ID No. 960; 



SEQ ID No. 445; 

SEQ ID No. 813; 

SEQ ID No. 947; 

SEQ ID No. 953; 
SEQ ID No. 958; 
SEQ ID No. 1043; 



SEQ ID No. 584; 
SEQ ID No. 942; 
SEQ ID No. 950; 
SEQ ID No. 955; 
SEQ ID No. 961; 



SEQ ID No. 585 
SEQ ID No. 944 
SEQ ID No. 951 
SEQ ID No. 955 
SEQ ID No. 1040 



SEQ ID No. 375; 
SEQ ID No. 743; 
SEQ ID No. 946; 
SEQ ID No. 952; 
20 SEQ ID No. 957; 
SEQ ID No. 1041 
fragments. 

Preferably, the invention also relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis transport polypeptide or one of its representative 
25 fragments, and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 6; SEQ ID No. 50; SEQ ID No. 51; SEQ ID No. 80; SEQ ID No. 125; SEQ ID No. 126 



3 ID No. 1063; SEQ ID No. 1064 and one of their 



SEQ ID No. 128 
SEQ ID No. 249; 
SEQ ID No. 271 

30 SEQ ID No. 324: 
SEQ ID No. 512 
SEQ ID No. 680 
SEQ ID No. 715 
SEQ ID No. 738: 

35 SEQ ID No. 987 



SEQ ID No. 129 
SEQ ID No. 251 
SEQ ID No. 275 
SEQ ID No. 398; 
SEQ ID No. 564 
SEQ ID No. 711 
SEQ ID No. 730; 
SEQ ID No. 870 
SEQ ID No. 988 



SEQ ID No. 215 
SEQ ID No. 252: 
SEQ ID No. 293 
SEQ ID No. 401 
SEQ ID No. 565 
SEQ ID No. 712 
SEQ ID No. 731 
SEQ ID No. 908; 
SEQ ID No. 992 



SEQ ID No. 246 
SEQ ID No. 253 
SEQ ID No. 309 
SEQ ID No. 449; 
SEQ ID No. 667 
SEQ ID No. 713 
SEQ ID No. 736: 
SEQ ID No. 919 
SEQ ID No. 993 



SEQ ID No. 248 
SEQ ID No. 255 
SEQ ID No. 323: 
SEQ ID No. 511; 
SEQ ID No. 679 
SEQ ID No. 714 
SEQ ID No. 737 
SEQ ID No. 977 
SEQ ID No. 994: 
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SEQ ID No. 1028; SEQ ID No. 1029 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the virulence process, and in that it is chosen from the polypeptides having the 
5 following sequences: 

SEQ ID No. 20; SEQ ID No. 815; SEQ ID No. 816; SEQ ID No. 898; SEQ ID No. 1059; 
SEQ ID No. 1060 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
10 which is involved in the secretory system and/or which is secreted, and in that it is chosen from the 
polypeptides having the following sequences: 

SEQ ID No. 758; SEQ ID No. 888; SEQ ID No. 889; SEQ ID No. 890; SEQ ID No. 891; 
SEQ ID No. 896; SEQ ID No. 897; SEQ ID No. 898 and one of their representative fragments. 

The secreted polypeptides, including the Type IH and other, non-Type III secreted 

15 polypeptides, of the present invention, as well as the corresponding nucleotide sequences, may be 
detected by techniques known to persons skilled in the art, such as for example the techniques using 
cloning combined with vectors allowing the expression of the said polypeptides fused to export 
markers such as the luc gene for luciferase or the PhoA gene for alkaline phosphatase. 

Preferably, the invention relates to a polypeptide according to the invention, 

20 characterized in that it is a polypeptide specific to Chlamydiae or one of its representative fragments, 
and in that it is chosen from the polypeptides having the following sequences: 



SEQ ID No. 22; SEQ ID No. 29; SEQ ID No. 31; SEQ ID No. 32; SEQ ID No. 34; SEQ ID No. 35; 
SEQ ID No. 39; SEQ ID No. 40; SEQ ID No. 43; SEQ ID No. 48; SEQ ID No. 49; SEQ ID No. 50; 
SEQ ID No. 52; SEQ ID No. 53; SEQ ID No. 54; SEQ ID No. 72; SEQ ID No. 77; SEQ ID No. 78; 



25 


SEQ ID No. 87; 


SEQ ID No. 90; 


SEQ ID No. 95; 


SEQ ID No. 


108; 


SEQ ID No 


110 




SEQ ID No. Ill; 


SEQ ID No. 122 


SEQ ID No. 123 


SEQ ID No 


124; 


SEQ ID No 


127 




SEQ ID No. 138; 


SEQ ID No. 144 


SEQ ID No. 146 


SEQ ID No 


153; 


SEQ ID No 


155 




SEQ ID No. 164; 


SEQ ID No. 166 


SEQ ID No. 175 


SEQ ID No 


182; 


SEQ ID No 


184 




SEQ ID No. 186; 


SEQ ID No. 187 


SEQ ID No. 188 


SEQ ID No 


202; 


SEQ ID No 


210 


30 


SEQ ID No. 247; 


SEQ ID No. 258 


SEQ ID No. 266 


SEQ ID No 


267; 


SEQ ID No 


270 




SEQ ID No. 273; 


SEQ ID No. 274 


SEQ ID No. 295 


SEQ ID No 


296; 


SEQ ID No 


305- 




SEQ ID No. 306; 


SEQ ID No. 309 


SEQ ID No. 318 


SEQ ID No 


319; 


SEQ ID No 


322; 




SEQ ID No. 326; 


SEQ ID No. 342 


SEQ ID No. 357 


SEQ ID No 


376; 


SEQ ID No 


379; 




SEQ ID No. 380; 


SEQ ID No. 388 


SEQ ID No. 390 


SEQ ID No 


400; 


SEQ ID No 


431; 


35 


SEQ ID No. 433; 


SEQ ID No. 438 


SEQ ID No. 443 


SEQ ID No. 456; 


SEQ ID No 


457; 
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SEQIDNo.464; SEQIDNo.468; SEQIDNo.470; SEQIDNo.473; 
SEQIDNo.489; SEQIDNo.497; SEQIDNo.501; SEQ ID No. 503: 
SEQIDNo.508; SEQIDNo.512; SEQIDNo.521; SEQ ID No. 522; 
SEQIDNo.524; SEQIDNo.533; SEQIDNo.535; SEQIDNo.536; 
SEQIDNo.538; SEQ ID No. 539; SEQ ID No. 540; SEQIDNo.554: 
SEQIDNo.572; SEQ ID No. 579; SEQ ID No. 595; SEQ ID No. 603: 
SEQ ID No. 606; SEQ ID No. 607; SEQ ID No. 615; SEQ ID No. 616: 
SEQ ID No. 641; SEQ ID No. 642; SEQ ID No. 659; SEQ ID No. 668: 
SEQ ID No. 693; SEQ ID No. 695; SEQ ID No. 696; SEQ ID No. 699; 
SEQ ID No. 704; SEQ ID No. 716; SEQ ID No. 726; SEQ ID No. 728; 
SEQ ID No. 742; SEQ ID No. 747; SEQ ID No. 750; SEQ ID No. 751: 
SEQ ID No. 757; SEQ ID No. 759; SEQ ID No. 761; SEQ ID No. 762; 
SEQ ID No. 764; SEQ ID No. 773; SEQ ID No. 780; SEQ ID No. 781 
SEQ ID No. 800; SEQ ID No. 803; SEQ ID No. 804; SEQ ID No. 818; 
SEQ ID No. 822; SEQ ID No. 823; SEQ ID No. 824; SEQ ID No. 827; 
SEQ ID No. 839; SEQ ID No. 849; SEQ ID No. 850; SEQ ID No. 851 
SEQ ID No. 855; SEQ ID No. 856; SEQ ID No. 857; SEQ ID No. 858; 
SEQ ID No. 860; SEQ ID No. 861; SEQ ID No. 862; SEQ ID No. 863; 
SEQ ID No. 868; SEQ ID No. 869; SEQ ID No. 870; SEQ ID No. 871 
SEQ ID No. 873; SEQ ID No. 874; SEQ ID No. 875; SEQ ID No. 877: 
SEQ ID No. 880; SEQ ID No. 882; SEQ ID No. 884; SEQ ID No. 886; 
SEQ ID No. 901; SEQ ID No. 906; SEQ ID No. 910; SEQ ID No. 912: 
SEQ ID No. 916; SEQ ID No. 917; SEQ ID No. 926; SEQ ID No. 929: 
SEQ ID No. 965; SEQ ID No. 967; SEQ ID No. 968; SEQ ID No. 984: 
SEQ ID No. 989; SEQ ID No. 990; SEQ ID No. 996; SEQ ID No. 997: 
SEQ ID No. 1002; SEQ ID No. 1013; SEQ ID No. 1016; SEQ ID No. 1031 
SEQ ID No. 1035; SEQ ID No. 1049; SEQ ID No. 1051; SEQ ID No. 1052: 
SEQ ID No. 1057; SEQ ID No. 1058; SEQ ID No. 1062; 
SEQ ID No. 1070; SEQ ID No. 107 1 ; SEQ ID No. 1073 and one of their representative fragments. 

In general, in the present invention, the functional group to which a polypeptide of the 
invention belongs, as well as its corresponding nucleotide sequence, may be determined either by 
comparative analogy with sequences already known, or by the use of standard techniques of 
biochemistry, of cytology combined with the techniques of genetic engineering such as 
immunoaffinity, localization by immunolabelling, differential extraction, measurement of enzymatic 
35 activity, study of the activity inducing or repressing expression or the study of expression in E. coli. 



SEQ ID No. 458; 

SEQ ID No. 486; 

SEQ ID No. 504; 

SEQ ID No. 523; 
5 SEQ ID No. 537; 

SEQ ID No. 563; 

SEQ ID No. 604; 

SEQ ID No. 622; 

SEQ ID No. 670; 
10 SEQ ID No. 703; 

SEQ ID No. 739; 

SEQ ID No. 755; 

SEQ ID No. 763; 

SEQ ID No. 789; 
15 SEQ ID No. 820; 

SEQ ID No. 828; 

SEQ ID No. 852; 

SEQ ID No. 859; 

SEQ ID No. 865; 
20 SEQ ID No. 872; 

SEQ ID No. 878; 

SEQ ID No. 893; 

SEQ ID No. 915; 

SEQ ID No. 933; 
25 SEQ ID No. 986; 

SEQ ID No. 1001; 

SEQ ID No. 1033; 

SEQ ID No. 1 054; SEQ ID No. 1056; 

30 
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It is clearly understood, on the one hand, that, in the present invention, the nucleotide 
sequences (ORF) and the amino acid sequences (SEQ ID No. 2 to SEQ ID No. 1197 ) which are listed 
by functional group, are not exhaustive within the group considered. Moreover, it is also clearly 
understood that, in the present invention, a nucleotide sequence (ORF) or an amino acid sequence 
5 mentioned within a given functional group may also be part of another group taking into account, for 
example, the interrelationship between the groups listed. Accordingly, and as an example of this 
interrelationship, an exported and/or secreted polypeptide as well as its coding nucleotide sequence 
may also be involved in the Chlamydia trachomatis virulence process by modifying the defense 
mechanism of the infected host cell, or a transmembrane polypeptide or its coding nucleotide 
10 sequence is also part of the polypeptides or coding nucleotide sequences of the cellular envelope. 

The subject of the present invention is also the nucleotide and/or polypeptide 
sequences according to the invention, characterized in that the said sequences are recorded on a 
medium, called recording medium, whose type and nature facilitate the reading, the analysis and the 
exploitation of the said sequences. These media may of course also contain other information 
15 extracted from the present invention, such as in particular the analogies with already known 
sequences, such as those mentioned in Table 1 of the present description, and/or may contain, in 
addition, information relating to the nucleotide and/or polypeptide sequences of other microorganisms 
so as to facilitate the comparative analysis and the exploitation of the results obtained. 

Among these recording media, computer-readable media, such as magnetic, optical, 
20 electrical and hybrid media such as, for example, floppy disks, CD-ROMs or recording cassettes, are 
preferred in particular. 

The invention also relates to nucleotide sequences which can be used as primer or 
probe, characterized in that the said sequences are chosen from the nucleotide sequences according to 
the invention. 

25 The invention relates, in addition, to the use of a nucleotide sequence according to the 

invention, as primer or probe, for the detection and/or amplification of nucleic acid sequences. 

The nucleotide sequences according to the invention may thus be used to amplify 
nucleotide sequences, in particular by the PCR technique (polymerase chain reaction) (Erlich, 1989; 
Innis et al., 1990; Rolfs et al., 1991, and White et al., 1997). 

30 These oligodeoxyribonucleotide or oligoribonucleotide primers correspond to 

representative nucleotide fragments, and are advantageously at least 8 nucleotides, preferably at least 
12 nucleotides, 15 nucleotides and still more preferably at least 20 nucleotides long. 

Other techniques for amplifying the target nucleic acid may be advantageously used 
as alternatives to PCR. 

35 The nucleotide sequences of the invention, in particular the primers according to the 
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invention, may also be used in other methods for amplifying a target nucleic acid, such as: 

the TAS (Transcription-based Amplification System) technique described by Kwoh et al. in 

1989; 

the 3SR (Self-Sustained Sequence Replication) technique described by Guatelli et al. in 1990; 
the NASBA (Nucleic Acid Sequence Based Amplification) technique described by Kievitis 
etal. in 1991; 

the SDA (Strand Displacement Amplification) technique (Walker et al., 1992); 

the TMA (Transcription Mediated Amplification) technique. 

The polynucleotides of the invention may also be used in techniques for amplifying 
or for modifying the nucleic acid serving as probe, such as: 

the LCR (Ligase Chain Reaction) technique described by Landegren etal. in 1988 and 
perfected by Barany et al. in 1991, which uses a thermostable ligase; 

the RCR (Repair Chain Reaction) technique described by Segev in 1992; 

the CPR (Cycling Probe Reaction) technique described by Duck et al. in 1990; 

the Q-beta-replicase amplification technique described by Miele et al. in 1983 and perfected 
in particular by Chu et al. in 1986, Lizardi et al. in 1988, and then by Burg et al. as well as by Stone 
et al. in 1996. 

The invention also relates to the nucleotide sequences of fragments which can be 
obtained by amplification with the aid of at least one primer according to the invention. The present 
invention encompasses both hybridization probes and primers. In general, the complementary probes 
should be of the length sufficient to form a stable hybrid complex with the target sequences. Primers, 
while complementary to the target sequences need not form stable hybridization complexes with the 
target sequences alone. Rather, primers form stable complexes with the target sequences in the 
presence of polymerase to permit extension of the primer. 

In the case where the target polynucleotide to be detected is possibly an RNA, for 
example an mRNA, it will be possible to use, prior to the use of an amplification reaction with the aid 
of at least one primer according to the invention or to the use of a method of detection with the aid of 
at least one probe of the invention, a reverse transcriptase-type enzyme so as to obtain a cDNA from 
the RNA contained in the biological sample. The cDNA obtained will then serve as target for the 
primer(s) or the probe(s) used in the amplification or detection method according to the invention. 

The detection probe will be chosen so that it hybridizes with the target sequence or 
the amplicon generated from the target sequence. Such a detection probe will advantageously have as 
sequence a sequence of at least 12 nucleotides, 15 nucleotides, in particular of at least 20 nucleotides, 
and preferably at least 100 nucleotides. 

The invention also comprises the nucleotide sequences which can be used as probe or 
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primer according to the invention, characterized in that they are labelled with a radioactive compound 
or with a nonradioactive compound. 

The nonlabelled nucleotide sequences may be used directly as probes or primers; 
however, the sequences are generally labelled with a radioactive element ( 32 P, 35 S, 3 H, 125 I) or with a 
5 nonradioactive molecule (biotin, acetylaminofluorene, digoxigenin, 5-bromo-deoxyuridine, 
fluorescein) so as to obtain probes which can be used in numerous applications. 

Examples of nonradioactive labelling of nucleotide sequences are described, for 
example, in French patent No. 78,10975 or by Urdea et al. or by Sanchez-Pescador et al. in 1988. 

In the latter case, one of the labelling methods described in patents FR-2 422 956 and 
10 FR-2 518 755 may also be used. 

The invention also relates to the nucleotide sequences of fragments which can be 
obtained by hybridization with the aid of at least one probe according to the invention. 

The hybridization technique may be performed in various ways (Matthews et al., 
1988). The most common method consists in immobilizing the nucleic acid extracted from C. 
15 trachomatis cells on a support (such as nitrocellulose, nylon, polystyrene) and in incubating, under 
well-defined conditions, the target nucleic acid immobilized with the probe. After hybridization, the 
excess probe is removed and the hybrid molecules formed are detected by the appropriate method 
(measurement of the radioactivity, of the fluorescence or of the enzymatic activity linked to the 
probe). 

20 The invention also comprises the nucleotide sequences according to the invention, 

characterized in that they are covalently or noncovalently immobilized on a support. 

According to another advantageous embodiment of the nucleic sequences according 
to the invention, the latter may be used immobilized on a support and may thus serve to capture, 
through specific hybridization, the target nucleic acid obtained from the biological sample to be 

25 tested. If necessary, the solid support is separated from the sample and the hybridization complex 
formed between the so-called capture probe and the target nucleic acid is then detected by means of a 
second probe, called detection probe, labelled with an easily detectable element. 

The nucleotide sequences according to the invention may also be used in new 
analytical systems, DNA chips, which allow sequencing, the study of mutations and of the expression 

30 of genes, and which are currently of interest given their very small size and their high capacity in 
terms of number of analyses. 

The principle of the operation of these chips is based on molecular probes, most often 
oligonucleotides, which are attached onto a miniaturized surface, generally of the order of a few 
square centimetres. During an analysis, a sample containing fragments of a target nucleic acid to be 

35 analysed, for example DNA or RNA labelled, for example, after amplification, is deposited onto the 
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DNA chip in which the support has been coated beforehand with probes. Bringing the labelled target 
sequences into contact with the probes leads to the formation, through hybridization, of a duplex 
according to the rule of pairing defined by J.D. Watson and F. Crick. After a washing step, analysis of 
the surface of the chip allows the effective hybridizations to be located by means of the signals 
5 emitted by the labels tagging the target. A hybridization fingerprint results from this analysis which, 
by appropriate computer processing, will make it possible to determine information such as the 
presence of specific fragments in the sample, the determination of sequences and the presence of 
mutations. 

The chip consists of a multitude of molecular probes, precisely organized or arrayed 
10 on a solid support whose surface is miniaturized. It is at the centre of a system where other elements 
(imaging system, microcomputer) allow the acquisition and interpretation of a hybridization 
fingerprint. 

The hybridization supports are provided in the form of flat or porous surfaces 
(pierced with wells) composed of various materials. The choice of a support is determined by its 

15 physicochemical properties, or more precisely, by the relationship between the latter and the 
conditions under which the support will be placed during the synthesis or the attachment of the probes 
or during the use of the chip. It is therefore necessary, before considering the use of a particular 
support (R.S. Matson et al., 1994), to consider characteristics such as its stability to pH, its physical 
strength, its reactivity and its chemical stability as well as its capacity to nonspecifically bind nucleic 

20 acids. Materials such as glass, silicon and polymers are commonly used. Their surface is, in a first 
step, called «functionalization», made reactive towards the groups which it is desired to attach 
thereon. After the functionalization, so-called spacer molecules are grafted onto the activated surface. 
Used as intermediates between the surface and the probe, these molecules of variable size render 
unimportant the surface properties of the supports, which often prove to be problematic for the 

25 synthesis or the attachment of the probes and for the hybridization. 

Among the hybridization supports, there may be mentioned glass which is used, for 
example, in the method of in situ synthesis of oligonucleotides by photochemical addressing 
developed by the company Affymetrix (E.L. Sheldon, 1993), the glass surface being activated by 
silane. Genosensor Consortium (P. Merel, 1994) also uses glass slides carrying wells 3 mm apart, this 

30 support being activated with epoxysilane. 

Polymers or silicon may also be mentioned among these hybridization supports. For 
example, the Andrein Mirzabekov team has developed a chip consisting of polyacrylamide squares 
polymerized on a silanized glass surface (G. Yershov etal., 1996). Several teams use silicon, in 
particular the IFOS laboratory of Ecole Centrale of Lyon which uses a silicon semiconductor substrate 

35 which is p-doped by introducing it into its crystalline structure atoms whose valency is different from 
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that of silicon. Various types of metals, in particular gold and platinum, may also be used as support 
(Genosensor Consortium (K. Beattie et al., 1993)). 

The probes according to the invention may be synthesized directly in situ on the 
supports of the DNA chips. This in situ synthesis may be carried out by photochemical addressing 
5 (developed by the company Affymax (Amsterdam, Holland) and exploited industrially by its 
subsidiary Affymetrix (United States)) or based on the VLSIPS (very large scale immobilized 
polymer synthesis) technology (S.P.A. Fodor etal., 1991) which is based on a method of 
photochemically directed combinatory synthesis and the principle of which combines solid-phase 
chemistry, the use of photolabile protecting groups and photolithography. 
10 The probes according to the invention may be attached to the DNA chips in various 

ways such as electrochemical addressing, automated addressing or the use of probe printers 
(T. Livache et al., 1994; G. Yershov et al., 1996; J. Derisi et al., 1996, and S. Borman, 1996). 

The revealing of the hybridization between the probes of the invention, deposited or 
synthesized in situ on the supports of the DNA chips, and the sample to be analysed, may be 
15 determined, for example, by measurement of fluorescent signals, by radioactive counting or by 
electronic detection. 

The use of fluorescent molecules such as fluorescein constitutes the most common 
method of labelling the samples. It allows direct or indirect revealing of the hybridization and allows 
the use of various fluorochromes. 

20 Affymetrix currently provides an apparatus or a scanner designed to read its Gene 

Chip? chips. It makes it possible to detect the hybridizations by scanning the surface of the chip in 
confocal microscopy (R J. Lipshutz et al., 1995). Other methods of detecting fluorescent signals have 
been tested: coupling of an epifluorescence microscope and a CCD camera (G. Yershov et al., 1996), 
the use of an optical fibre collecting system (E.L. Sheldon, 1993). A conventional method consists in 

25 carrying out an end labelling, with phosphorus 32, of the target sequences, by means of an appropriate 
apparatus, the Phosphorimager (marketed by Molecular Dynamics). The electronic detection is based 
on the principle that the hybridization of two nucleic acid molecules is accompanied by physical 
phenomena which can be quantified under certain conditions (system developed by Ecole Centrale of 
Lyon and called GEN-FET (GEN field effect transistor)). Genosensor Consortium and the company 

30 Beckman Instruments who are developing an electronic chip or Permittivity Chips? may also be 
mentioned (K. Beattie et al., 1993). 

The nucleotide sequences according to the invention may thus be used in DNA chips 
to carry out the analysis of mutations. This analysis is based on the production of chips capable of 
analysing each base of a nucleotide sequence according to the invention. It is possible, in particular to 

35 this end, to use the microsequencing techniques on a DNA chip. The mutations are detected by 
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extending immobilized primers which hybridize to the template of sequences analysed, just at the 
position adjacent to that of the mutated nucleotide to be detected. A single-stranded template, RNA or 
DNA, of the sequences to be analysed will be advantageously prepared according to conventional 
methods, from products amplified according to PCR-type techniques. The templates of single- 
5 stranded DNA, or of RNA thus obtained are then deposited on the DNA chip, under conditions 
allowing their specific hybridization to the immobilized primers. A thermostable polymerase, for 
example Tth or T7 DNA polymerase, specifically extends the 3 ' end of the immobilized primer with 
a labelled nucleotide analogue complementary to the nucleotide at the position of the variable site. 
For example a thermal cycling is performed in the presence of fluorescent dideoxyribonucleotides. 

10 The experimental conditions will be adapted in particular to the chips used, to the immobilized 
primers, to the polymerases used and to the labelling system chosen. One advantage of 
microsequencing, compared with techniques based on the hybridization of probes, is that it makes it 
possible to identify all the variable nucleotides with optimal discrimination under homogeneous 
reaction conditions; used on DNA chips, it allows optimal resolution and specificity for the routine 

15 and industrial detection of mutations in multiplex. 

The nucleotide sequences according to the invention may also be used in DNA chips 
to carry out the analysis of the expression of the Chlamydia trachomatis genes. This analysis of the 
expression of Chlamydia trachomatis genes is based on the use of chips where probes of the 
invention, chosen for their specificity to characterize a given gene, are present (D.J. Lockhart et al., 

20 1996; D.D. Shoemaker et al, 1996). For the methods of analysis of gene expression using the DNA 
chips, reference may, for example, be made to the methods described by D.J. Lockhart et al. (1996) 
and Sosnowsky etal. (1997) for the synthesis of probes in situ or for the addressing and the 
attachment of previously synthesized probes. The target sequences to be analysed are labelled and in 
general fragmented into sequences of about 50 to 100 nucleotides before being hybridized onto the 

25 chip. After washing as described, for example, by D.J. Lockhart et al. (1996) and application of 
different electric fields (Sosnowsky et al., 1997), the labelled compounds are detected and quantified, 
the hybridizations being carried out at least in duplicate. Comparative analyses of the signal 
intensities obtained with respect to the same probe for different samples and/or for different probes 
with the same sample, determine the differential expression of RNA or of DNA derived from the 

30 sample. 

The nucleotide sequences according to the invention may, in addition, be used in 
DNA chips where other nucleotide probes specific for other microorganisms are also present, and may 
allow the carrying out of a serial test allowing rapid identification of the presence of a microorganism 
in a sample. 

35 Accordingly, the subject of the invention is also the nucleotide sequences according 
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to the invention, characterized in that they are immobilized on a support of a DNA chip. 

The DNA chips, characterized in that they contain at least one nucleotide sequence 
according to the invention, immobilized on the support of the said chip, also form part of the 
invention. 

5 The said chips will preferably contain several probes or nucleotide sequences of the 

invention of different length and/or corresponding to different genes so as to identify, with greater 
certainty, the specificity of the target sequences or the desired mutation in the sample to be analysed. 

Accordingly, the analyses carried out by means of primers and/or probes according to 
the invention, immobilized on supports such as DNA chips, will make it possible, for example, to 
10 identify, in samples, mutations linked to variations such as intraspecies variations. These variations 
may be correlated or associated with pathologies specific to the variant identified and will make it 
possible to select the appropriate treatment. 

The invention thus comprises a DNA chip according to the invention, characterized in 
that it contains, in addition, at least one nucleotide sequence of a microorganism different from 
15 Chlamydia trachomatis, immobilized on the support of the said chip; preferably, the different 
microorganism will be chosen from an associated microorganism, a bacterium of the Chlamydia 
family, and a variant of the species Chlamydia trachomatis. 

Another subject of the present invention is a vector for the cloning and/or the 
expression of a sequence, characterized in that it contains a nucleotide sequence according to the 
20 invention. 

Among the said vectors according to the invention, the vectors containing a 
nucleotide sequence encoding a polypeptide of the cellular, preferably outer, envelope of Chlamydia 
trachomatis or one of its representative fragments, are preferred. 

In a specific embodiment, the vectors contain a nucleotide sequence encoding a 

25 Chlamydia trachomatis secreted polypeptide or one of its representative fragments or encoding a 
transport polypeptide, a surface exposed polypeptide, a lipoprotein or one of its representative 
fragments, a polypeptide involved in lipopolysaccharide (LPS) biosynthesis, a Type III or non-Type 
m secreted polypeptide, a polypeptide containing RGD attachment sites, a cell wall anchored surface 
polypeptide, a polypeptide not found in Chlamydia pneumoniae, a ribosomal polypeptide or a 

30 polypeptide involved in secretion, transcription, translation, maturation of proteins, a polypeptide 
involved in the synthesis of the wall, a polypeptide involved in the virulence, a polypeptide involved 
in the intermediate metabolism, in particular in the metabolism of sugars and/or of cofactors, a 
polypeptide involved in the metabolism of nucleotides, of amino acids, of nucleic acids or of fatty 
acids of Chlamydia trachomatis or one of their representative fragments, or a polypeptide specific to 

35 Chlamydiae, are also preferred. 
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According to the invention, the vectors comprise the elements necessary to allow the 
expression and/or the secretion of the said nucleotide sequences in a given host cell, and also form 
part of the invention. 

The vector should, in this case, comprise a promoter, signals for initiation and for 
5 termination of translation, as well as appropriate regions for regulation of transcription. It should be 
capable of being stably maintained in the host cell and may optionally possess particular signals 
specifying the secretion of the translated protein. These different elements are chosen according to the 
host cell used. To this effect, the nucleotide sequences according to the invention may be inserted into 
autonomously-replicating vectors within the chosen host, or integrative vectors in the chosen host. 

10 Any of the standard methods known to those skilled in the art for the insertion of 

DNA fragments into a vector may be used to construct expression vectors containing a chimeric gene 
consisting of appropriate transcriptional/translational control signals and the protein coding 
sequences. These methods may include in vitro recombinant DNA and synthetic techniques and in 
vivo recombinants (genetic recombination). 

15 Expression of a polypeptide, peptide or derivative, or analogs thereof encoded by a 

polynucleotide sequence in SEQ ID No. 1 or ORFs contained within SEQ ID No. 1 may be regulated 
by a second nucleic acid sequence so that the protein or peptide is expressed in a host transformed 
with the recombinant DNA molecule. For example, expression of a protein or peptide may be 
controlled by any promoter/enhancer element known in the art. Promoters which may be used to 

20 control expression include, but are not limited to, the CMV promoter, the SV40 early promoter region 
(Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained in the 3' long terminal 
repeat of Rous sarcoma virus (Yamamoto, et al., 1980, Cell 22:787-797), the herpes thymidine kinase 
promoter (Wagner et al, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences 
of the metallothionein gene (Brinster et al, 1982, Nature 296:39-42); prokaryotic expression vectors 

25 such as the ^-lactamase promoter (Villa-Kamaroff, et al, 1978, Proc. Natl. Acad. Sci. U.SA. 
75:3727-3731), or the tac promoter (DeBoer, et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80:21-25); see 
also "Useful proteins from recombinant bacteria" in Scientific American, 1980, 242:74-94; plant 
expression vectors comprising the nopaline synthetase promoter region (Herrera-Estrella et al, 1983, 
Nature 303:209-213) or the cauliflower mosaic virus 35S RNA promoter (Gardner, et al, 1981, Nucl. 

30 Acids Res. 9:287 1 ), and the promoter of the photosynthetic enzyme ribulose biphosphate carboxylase 
(Herrera-Estrella et al, 1984, Nature 310:1 15-120); promoter elements from yeast or other fungi such 
as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) 
promoter, alkaline phosphatase promoter, and the following animal transcriptional control regions, 
which exhibit tissue specificity and have been utilized in transgenic animals: elastase I gene control 

35 region which is active in pancreatic acinar cells (Swift et al, 1984, Cell 38:639-646; Ornitz et al, 
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1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, Hepatology 7:425- 
515); insulin gene control region which is active in pancreatic beta cells (Hanahan, 1985, Nature 
315:115-122), immunoglobulin gene control region which is active in lymphoid cells (Grosschedl et 
al, 1984, Cell 38:647-658; Adames et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. 
5 Cell. Biol. 7:1436-1444), mouse mammary tumor virus control region which is active in testicular, 
breast, lymphoid and mast cells (Leder et al, 1986, Cell 45:485-495), albumin gene control region 
which is active in liver (Pinkert et al, 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf et al, 1985, Mol. Cell. Biol. 5:1639-1648; Hammer 
et al, 1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is active in the liver 

10 (Kelsey et al, 1987, Genes and Devel. 1:161-171), beta-globin gene control region which is active in 
myeloid cells (Mogram et al, 1985, Nature 315:338-340; Kollias et al, 1986, Cell 46:89-94; myelin 
basic protein gene control region which is active in oligodendrocyte cells in the brain (Readhead et 
al, 1987, Cell 48:703-712); myosin light chain-2 gene control region which is active in skeletal 
muscle (Sani, 1985, Nature 314:283-286), and gonadotropic releasing hormone gene control region 

1 5 which is active in the hypothalamus (Mason et al, 1986, Science 234:1372-1378). 

The vectors according to the invention are, for example, vectors of plasmid or viral 
origin. In a specific embodiment, a vector is used that comprises a promoter operably linked to a 
protein or peptide-encoding nucleic acid sequence in SEQ ID No. 1, or ORFs contained within SEQ 
ID No. 1, one or more origins of replication, and, optionally, one or more selectable markers {e.g., an 

20 antibiotic resistance gene). Expression vectors comprise regulatory sequences that control gene 
expression, including gene expression in a desired host cell. Preferred vectors for the expression of 
the polypeptides of the invention include the pET-type plasmid vectors (Promega) or pBAD plasmid 
vectors (Invitrogen). Furthermore, the vectors according to the invention are useful for transforming 
host cells so as to clone or express the nucleotide sequences of the invention. 

25 Expression can also be achieved using targeted homologous recombination to activate 

Chlamydia trachomatis genes present in the cloned genomic DNA. A heterologous regulatory 
element may be inserted into a stable cell line or cloned microorganism, such that it is operatively 
linked with an endogenous Chlamydia trachomatis gene present in the cloned genome, using 
techniques, such as targeted homologous recombination, which are well known to those of skill in the 

30 art {See, e^, Chappel, U.S. Patent No. 4,215,051 and Skoultchi, WO 91/06667 each of which is 
incorporated herein in its entirety). 

Expression vector/host cell systems containing inserts of polynucleotide sequences in 
SEQ ID No. 1 or ORFs within SEQ ID No. 1, which encode polypeptides, peptides or derivatives, or 
analogs thereof, can be identified by three general approaches: (a) nucleic acid hybridization, (b) 

35 presence or absence of "marker" gene functions, and (c) expression of inserted sequences. In the first 
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approach, the presence of a polynucleotide sequence inserted in an expression vector can be detected 
by nucleic acid hybridization using probes comprising sequences that are homologous to an inserted 
polynucleotide sequence. In the second approach, the recombinant vector/host system can be 
identified and selected based upon the presence or absence of certain "marker" gene functions (e.g., 
5 thymidine kinase activity, resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, etc.) caused by the insertion of a polynucleotide sequence in the vector. For 
example, if the polynucleotide sequence in SEQ ID No. 1 or ORFs within SEQ ID No. 1 is inserted 
within the marker gene sequence of the vector, recombinants containing the insert can be identified by 
the absence of the marker gene function. In the third approach, recombinant expression vectors can 

10 be identified by assaying the product of the polynucleotide sequence expressed by the recombinant. 
Such assays can be based, for example, on the physical or functional properties of the expressed 
polypeptide in in vitro assay systems, e.g., binding with antibody, promotion of cell proliferation. 

Once a particular recombinant DNA molecule is identified and isolated, several 
methods known in the art may be used to propagate it. The clones identified may be introduced into 

15 an appropriate host cell by standard methods, such as for example lipofection, electroporation, and 
heat shock. Once a suitable host system and growth conditions are established, recombinant 
expression vectors can be propagated and prepared in quantity. 

The invention also encompasses the host cells transformed by a vector according to 
the invention. These cells may be obtained by introducing into host cells a nucleotide sequence 

20 inserted into a vector as defined above, and then culturing the said cells under conditions allowing the 
replication and/or the expression of the transfected nucleotide sequence. 

The host cell may be chosen from eukaryotic or prokaryotic systems, such as for 
example bacterial cells (Olins and Lee, 1993), but also yeast cells (Buckholz, 1993), as well as animal 
cells, in particular cultures of mammalian cells (Edwards and Aruffo, 1993), and in particular Chinese 

25 hamster ovary (CHO) cells, but also insect cells in which methods using baculoviruses for example 
may be used (Luckow, 1993). 

Furthermore, a host cell strain may be chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion desired. 
Expression from certain promoters can be elevated in the presence of certain inducers; thus, 

30 expression of the genetically engineered polypeptide may be controlled. Furthermore, different host 
cells have characteristic and specific mechanisms for the translation^ and post-translational 
processing and modification (e.g., glycosylation, phosphorylation) of proteins. Appropriate cell lines 
or host systems can be chosen to ensure the desired modification and processing of the foreign protein 
expressed. For example, expression in a bacterial system can be used to produce an unglycosylated 

35 core protein product. Expression in yeast will produce a glycosylated product. Expression in 
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mammalian cells can be used to ensure "native" glycosylation of a heterologous protein. Furthermore, 
different vector/host expression systems may effect processing reactions to different extents. 

A preferred host cell for the expression of the proteins of the invention consists of 
prokaryotic cells, such as Gram negative bacteria. 

A further preferred host cell according to the invention is a bacterium belonging to 
the Chlamydia family, more preferably belonging to the species Chlamydia trachomatis or chosen 
from a microorganism associated with the species Chlamydia trachomatis. 

In other specific embodiments, the polypeptides, peptides or derivatives, or analogs 
thereof may be expressed as a fusion, or chimeric protein product (comprising the protein, fragment, 
analog, or derivative joined via a peptide bond to a heterologous protein sequence (of a different 
protein)). Such a chimeric product can be made by ligating the appropriate nucleic acid sequences 
encoding the desired amino acid sequences to each other by methods known in the art, in the proper 
coding frame, and expressing the chimeric product by methods commonly known in the art. 
Alternatively, such a chimeric product may be made by protein synthetic techniques, e.g., by use of a 
peptide synthesizer. 

Genomic sequences can be cloned and expressed as translational gene products (le,, 
peptides, polypeptides, and proteins) or transcriptional gene products (Le,, antisense and ribozymes). 

The invention further relates to the intracellular production of an antisense nucleic 
acid sequence of SEQ ID No. 1 by transcription from an exogenous sequence. For example, a vector 
can be introduced in vivo such that it is taken up by a cell, within which cell the vector or a portion 
thereof is transcribed, producing an antisense nucleic acid (RNA) of the invention. Such a vector 
would contain a sequence encoding an antisense nucleic acid. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the desired antisense 
RNA. Such vectors can be constructed by recombinant DNA technology methods standard in the art. 
Vectors can be plasmid, viral, or others known in the art, used for replication and expression in 
mammalian cells. Expression of the sequence encoding the antisense RNA can be by any promoter 
known in the art to act in mammalian, preferably human, cells. Such promoters can be inducible or 
constitutive. Such promoters include but are not limited to: the CMV promoter, the SV40 early 
promoter region (Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained in the 
3n long terminal repeat of Rous sarcoma virus (Yamamoto et al, 1980, Cell 22:787-797), the herpes 
thymidine kinase promoter (Wagner et al, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the 
regulatory sequences of the metallothionein gene (Brinster et al, 1982, Nature 296:39-42), etc. 

In a specific embodiment, the antisense oligonucleotide comprises catalytic RNA, or 
a ribozyme (see, e.g., PCT International Publication WO 90/1 1364, published October 4, 1990; Sarver 
et al., 1990, Science 247:1222-1225). In another embodiment, the oligonucleotide is a 2n-0- 
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methylribonucleotide (Inoue et al, 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
analog (Inoue et al, 1987, FEBS Lett. 215:327-330). 

In another embodiment, the antisense nucleic acids of the invention comprise a 
sequence complementary to at least a portion of an RNA transcript of a polynucleotide sequence in 
5 SEQ ID No. 1 . However, absolute complementarity, although preferred, is not required. A sequence 
"complementary to at least a portion of an RNA," as referred to herein, means a sequence having 
sufficient complementarity to be able to hybridize with the RNA, forming a stable duplex; in the case 
of double-stranded antisense nucleic acid sequence, a single strand of the duplex DNA may thus be 
tested, or triplex formation may be assayed. The ability to hybridize will depend on both the degree 
10 of complementarity and the length of the antisense nucleic acid. Generally, the longer the hybridizing 
nucleic acid, the more base mismatches with an RNA transcribed from SEQ ID No. 1 may contain and 
still form a stable duplex (or triplex, as the case may be). One skilled in the art can ascertain a 
tolerable degree of mismatch by use of standard procedures to determine the melting point of the 
hybridized complex. 

15 The invention also relates to the animals, except humans, comprising one of the 

above-described transformed cells according to the invention. 

The production of transgenic animals according to the invention overexpressing one 
or more of the Chlamydia trachomatis genes will be preferably carried out on rats, mice or rabbits 
according to methods well known to persons skilled in the art such as viral or non viral transfections. 

20 The transgenic animals overexpressing one or more of the said genes may be obtained by transfection 
of multiple copies of the said genes under the control of a powerful promoter of a ubiquitous nature, 
or which is selective for one type of tissue. The transgenic animals may also be obtained by 
homologous recombination on embryonic stem cells, transfer of these stem cells to embryos, selection 
of the chimeras affected at the level of the reproductive lines, and growth of the said chimeras. 

25 The transformed cells as well as the transgenic animals according to the invention can 

be used in methods of preparing the recombinant polypeptide. 

It is now possible to produce recombinant polypeptides in a relatively large quantity 
by genetic engineering using the cells transformed with expression vectors according to the invention 
or using transgenic animals according to the invention. 

30 The methods of preparing a polypeptide of the invention in recombinant form, 

characterized in that they use a vector and/or a cell transformed with a vector according to the 
invention and/or a transgenic animal comprising one of the said transformed cells according to the 
invention, are themselves included in the present invention. 

Among the said methods of preparing a polypeptide of the invention in recombinant 

35 form, the methods of preparation using a vector, and/or a cell transformed with the said vector and/or 
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a transgenic animal comprising one of the said transformed cells, containing a nucleotide sequence 
encoding a polypeptide of the cellular envelope of Chlamydia trachomatis or one of its representative 
fragments, more preferably encoding a polypeptide of the outer cellular envelope of Chlamydia 
trachomatis or one of its fragment, are preferred. 
5 Among the said methods of preparing a polypeptide of the invention in recombinant 

form, the methods of preparation using a vector, and/or a cell transformed with the said vector and/or 
a transgenic animal comprising one of the said transformed cells, containing a nucleotide sequence 
encoding a Chlamydia trachomatis secreted polypeptide or one of its representative fragments, or 
encoding a transport polypeptide, a surface exposed polypeptide, a lipoprotein or one of its 

10 representative fragments, a polypeptide involved in lipopolysaccharide biosynthesis, a Type III or 
other secreted polypeptide, a polypeptide containing RGD attachment sites, a cell wall anchored 
surface polypeptide, a polypeptide not found in Chlamydia pneumoniae, a ribosomal polypeptide or a 
polypeptide involved in secretion, transcription, translation, maturation of proteins, a polypeptide 
involved in the synthesis of the wall, a polypeptide involved in the virulence, a polypeptide involved 

15 in the intermediate metabolism, in particular in the metabolism of sugars and/or of cofactors, a 
polypeptide involved in the metabolism of nucleotides, of amino acids, of nucleic acids or of fatty 
acids of Chlamydia trachomatis or one of their representative fragments, or a polypeptide specific to 
Chlamydiae, are also preferred. 

The recombinant polypeptides obtained as indicated above may be provided either in 

20 glycosylated or nonglycosylated form and may or may not have the natural tertiary structure. 

A preferred variant consists in producing a recombinant polypeptide fused to a 
«carrier» protein (chimeric protein). The advantage of this system is that it allows a stabilization and 
a reduction in proteolysis of the recombinant product, an increase in solubility during renaturation 
in vitro and/or a simplification of purification when the fusion partner has affinity for a specific 

25 ligand. 

More particularly, the invention relates to a method of preparing a polypeptide of the 
invention comprising the following steps: 

a) culture of the transformed cells under conditions allowing the expression of a recombinant 
polypeptide having a nucleic acid sequence according to the invention; 
30 b) where appropriate, recovery of the said recombinant polypeptide. 

When the method of preparing a polypeptide of the invention uses a transgenic 
animal according to the invention, the recombinant polypeptide is then extracted from the said animal. 

The subject of the invention is also a polypeptide capable of being obtained by a 
method of the invention as described above. 
35 Tr »e invention also comprises a method of preparing a synthetic polypeptide, 
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characterized in that it uses an amino acid sequence of polypeptides according to the invention. 

The invention also relates to a synthetic polypeptide obtained by a method according 

to the invention. 

Polypeptides according to the invention may also be prepared by conventional 
5 techniques in the field of peptide synthesis under conditions suitable to produce the polypeptides 
encoded by the polynucleotide of the invention. This synthesis may be carried out in and recovered 
from a homogeneous solution or on a solid phase. 

For example, the synthesis technique in a homogeneous solution described by 
Houbenweyl in 1974 may be used. 
10 This method of synthesis consists in successively condensing, in pairs, the successive 

amino acids in the required order, or in condensing amino acids and fragments previously formed and 
already containing several amino acids in the appropriate order, or alternatively several fragments 
thus previously prepared, it being understood that care will have been taken to protect beforehand all 
the reactive functional groups carried by these amino acids or fragments, with the exception of the 
15 amine functional groups of one and the carboxyl functional groups of the other or vice versa, which 
should normally take part in the formation of the peptide bonds, in particular after activation of the 
carboxyl functional group, according to methods well known in peptide synthesis. 

According to another preferred technique of the invention, the one described by 
Merrifield is used. 

20 To manufacture a peptide chain according to the Merrifield method, a highly porous 

polymer resin is used, onto which the first C-terminal amino acid of the chain is attached. This amino 
acid is attached onto a resin via its carboxyl group and its amine functional group is protected. The 
amino acids which will constitute the peptide chain are thus attached, one after another, onto the 
amine group, each time deprotected beforehand, of the portion of the peptide chain already formed, 

25 and which is attached to the resin. When the entire peptide chain desired is formed, the protecting 
groups are removed from the various amino acids constituting the peptide chain and the peptide is 
detached from the resin with the aid of an acid. 

The invention relates, in addition, to hybrid (fusion) polypeptides having at least one 
polypeptide or one of its representative fragments according to the invention, and a sequence of a 

30 polypeptide capable of eliciting an immune response in humans or animals. 

Advantageously, the antigenic determinant is such that it is capable of eliciting a 
humoral and/or cellular response. 

An antigenic determinant may be identified by screening expression libraries of the 
Chlamydia trachomatis genome with antibodies contained in the serum of patients infected with a 

35 bacterium belonging to the species Chlamydia trachomatis. An antigenic determinant may comprise a 



WO 99/28475 PCT/IB98/01939 
63 

polypeptide or one of its fragments according to the invention, in glycosylated form, used in order to 
obtain immunogenic compositions capable of inducing the synthesis of antibodies directed against 
multiple epitopes. The said polypeptides or their glycosylated fragments also form part of the 
invention. 

5 These hybrid molecules may consist, in part, of a carrier molecule for polypeptides or 

for their representative fragments according to the invention, combined with a portion which may be 
immunogenic, in particular an epitope of the diphtheria toxin, the tetanus toxin, a hepatitis B virus 
surface antigen (patent FR 79 21811), the poliomyelitis virus VP1 antigen or any other viral or 
bacterial toxin or antigen. 

10 The methods of synthesizing the hybrid molecules include the methods used in 

genetic engineering to construct hybrid nucleotide sequences encoding the desired polypeptide 
sequences. Reference may be advantageously made, for example, to the technique for producing 
genes encoding fusion proteins described by Minton in 1984. 

The said hybrid nucleotide sequences encoding a hybrid polypeptide as well as the 

15 hybrid polypeptides according to the invention, characterized in that they are recombinant 
polypeptides obtained by the expression of the said hybrid nucleotide sequences, also form part of the 
invention. 

The invention also comprises the vectors characterized in that they contain one of the 
said hybrid nucleotide sequences. The host cells transformed by the said vectors, the transgenic 

20 animals comprising one of the said transformed cells as well as the methods of preparing recombinant 
polypeptides using the said vectors, the said transformed cells and/or the said transgenic animals of 
course also form part of the invention. 

The polypeptides according to the invention, the antibodies according to the invention 
described below and the nucleotide sequences according to the invention may advantageously be used 

25 in in vitro and/or in vivo methods for the detection and/or the identification of bacteria belonging to 
the species Chlamydia trachomatis, in a biological sample (biological tissue or fluid) which is likely 
to contain them. These methods, depending on the specificity of the polypeptides, of the antibodies 
and of the nucleotide sequences according to the invention which will be used, may in particular 
detect and/or identify the bacterial variants belonging to the species Chlamydia trachomatis as well as 

30 the associated microorganisms capable of being detected by the polypeptides, the antibodies and the 
nucleotide sequences according to the invention which will be chosen. It may, for example, be 
advantageous to choose a polypeptide, an antibody or a nucleotide sequence according to the 
invention, which is capable of detecting any bacterium of the Chlamydia family by choosing a 
polypeptide, an antibody and/or a nucleotide sequence according to the invention which is specific to 

35 the family or, on the contrary, it will be most particularly advantageous to target a variant of the 



WO 99/28475 PCT/IB98/01939 
64 

species Chlamydia trachomatis, which is responsible, for example, for the induction or the worsening 
of pathologies specific to the targeted variant, by choosing a polypeptide, an antibody and/or a 
nucleotide sequence according to the invention which is specific to the said variant. 

The polypeptides according to the invention may advantageously be used in a method 
5 for the detection and/or the identification of bacteria belonging to the species Chlamydia trachomatis 
or to an associated microorganism, in a biological sample (biological tissue or fluid) which is likely to 
contain them, characterized in that it comprises the following steps: 

a) bringing this biological sample into contact with a polypeptide or one of its representative 
fragments according to the invention (under conditions allowing an immunological reaction between 

10 the said polypeptide and the antibodies which may be present in the biological sample); 

b) detecting the antigen-antibody complexes which may be formed. 

Preferably, the biological sample consists of a fluid, for example a human or animal 
serum, blood or biopsies. 

Any conventional procedure may be used to carry out such a detection of the antigen- 
15 antibody complexes which may be formed. 

By way of example, a preferred method uses immunoenzymatic procedures based on 
the ELISA technique, immunofluorescence procedures or radioimmunological procedures (RIA), and 
the like. 

Accordingly, the invention also relates to the polypeptides according to the invention, 
20 labelled with the aid of a suitable label such as a label of the enzymatic, fluorescent or radioactive 
type. 

Such methods comprise, for example, the following steps: 
deposition of defined quantities of a polypeptide composition according to the invention into 
the wells of a microtitre plate, 
25 - introduction, into the said wells, of increasing dilutions of serum, or of a different biological 
sample as defined above, which has to be analysed, 
incubation of the microplate, 

introduction, into the wells of the microtitre plate, of labelled antibodies directed against 
human or animal immunoglobulins, these antibodies having been labelled with the aid of an enzyme 
30 selected from those which are capable of hydrolyzing a substrate, thereby modifying the absorption of 
the radiation of the latter, at least at a defined wavelength, for example at 550 nm, 

detection, by comparison with a control, of the quantity of substrate hydrolyzed. 

The invention also relates to a kit or set for the detection and/or the identification of 
bacteria belonging to the species Chlamydia trachomatis or to an associated microorganism, 
35 characterized in that it comprises the following components: 
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a polypeptide according to the invention, 

where appropriate, the reagents for constituting the medium appropriate for the 
immunological or specific reaction, 

the reagents allowing the detection of the antigen-antibody complexes produced by the 
5 immunological reaction between the polypeptide(s) of the invention and the antibodies which may be 
present in the biological sample, it being possible for these reagents also to carry a label, or to be 
capable of being recognized in turn by a labelled reagent, more particularly in the case where the 
polypeptide according to the invention is not labelled, 

where appropriate, a reference biological sample (negative control) free of antibodies 
1 0 recognized by a polypeptide according to the invention, 

where appropriate, a reference biological sample (positive control) containing a 
predetermined quantity of antibodies recognized by a polypeptide according to the invention. 

According to the invention, the polypeptides, peptides, fusion proteins or other 
derivatives, or analogs thereof encoded by a polynucleotide sequence in SEQ ID No. 1 , may be used 
15 as an immunogen to generate antibodies which immunospecifically bind such an immunogen. Such 
antibodies may include, but are not limited to, polyclonal and monoclonal antibodies, humanized or 
chimeric antibodies, single chain antibodies, Fab fragments, F(ab') 2 fragments, fragments produced by 
a Fab expression library, anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of 
the above. In a specific embodiment, the antibody to a polypeptide, peptide or other derivative, or 
20 analog thereof encoded by a polynucleotide sequence in SEQ ID No. 1 is a bispecific antibody (see 
generally, e.g. Fanger and Drakeman, 1995, Drug News and Perspectives 8: 133-137). Such a 
bispecific antibody is genetically engineered to recognize both (1) an epitope and (2) one of a variety 
of "trigger" molecules, e.g. Fc receptors on myeloid cells, and CD3 and CD2 on T cells, that have 
been identified as being able to cause a cytotoxic T-cell to destroy a particular target. Such bispecific 
25 antibodies can be prepared either by chemical conjugation, hybridoma, or recombinant molecular 
biology techniques known to the skilled artisan. 

Various procedures known in the art may be used for the production of polyclonal 
antibodies to a polypeptide, peptide or other derivative, or analog thereof encoded by a polynucleotide 
sequence in SEQ ID No. 1. For the production of antibody, various host animals can be immunized 
30 by injection with a polypeptide, or peptide or other derivative, or analog thereof, including but not 
limited to rabbits, mice, rats, etc. Various adjuvants, depending on the host species, may be used to 
increase the immunological response, including but not limited to Stimulon™ QS-21 (Aquila 
Biopharmaceuticals, Inc., Framingham, MA), MPL™ (3-O-deacylated monophosphoryl lipid A; RIBI 
ImmunoChem Research, Inc., Hamilton, MT), aluminum phosphate, IL- 12 (Genetics Institute, 
35 Cambridge, MA), Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, 
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surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanins, dinitrophenol, BCG (bacille Calmette-Guerin), and corynebacterium 
parvum. Alternatively, polyclonal antibodies may be prepared by purifying, on an affinity column 
onto which a polypeptide according to the invention has been previously attached, the antibodies 
5 contained in the serum of patients infected with a bacterium belonging to the species Chlamydia 
trachomatis. 

For preparation of monoclonal antibodies directed toward a polypeptide, peptide or 
other derivative, or analog, any technique which provides for the production of antibody molecules by 
continuous cell lines in culture may be used. For example, the hybridoma technique originally 

10 developed by Kohler and Milstein (1975, Nature 256:495-497), as well as the trioma technique, the 
human B-cell hybridoma technique (Kozbor et al, 1983, Immunology Today 4:72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole et al, 1985, in Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). In an additional embodiment of the 
invention, monoclonal antibodies can be produced in germ-free animals utilizing technology 

15 described in PCT/US 90/02545. In another embodiment of the invention, transgenic non-human 
animals can be used for the production of human antibodies utilizing technology described in WO 
98/24893 and WO 96/33735. According to the invention, human antibodies may be used and can be 
obtained by using human hybridomas (Cote et al, 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030) 
or by transforming human B cells with EBV virus in vitro (Cole et al, 1985, in Monoclonal 

20 Antibodies and Cancer Therapy. Alan R. Liss, pp. 77-96). In fact, according to the invention, 
techniques developed for the production of «chimeric antibodies», (Morrison et al, 1984, PROC. 
NATL. ACAD. SCI. U.S.A. 81:6851-6855; Neuberger et al, 1984, Nature 312:604-608: Takeda et 
al, 1985, Nature 314:452-454) by splicing the genes from a mouse antibody molecule specific for a 
polypeptide, peptide or other derivative, or analog together with genes from a human antibody 

25 molecule of appropriate biological activity can be used; such antibodies are within the scope of this 
invention. 

According to the invention, techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce polypeptide or peptide-specific single 
chain antibodies. An additional embodiment of the invention utilizes the techniques described for the 
30 construction of Fab expression libraries (Huse et al, 1989, Science 246:1275-1281) to allow rapid 
and easy identification of monoclonal Fab fragments with the desired specificity for polypeptides, 
derivatives, or analogs. 

Antibody fragments which contain the idiotype of the molecule can be generated by 
known techniques. For example, such fragments include but are not limited to: the F(ab') 2 fragment 
35 which can be produced by pepsin digestion of the antibody molecule; the Fab' fragments which can be 
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generated by reducing the disulfide bridges of the F(ab') 2 fragment, the Fab fragments which can be 
generated by treating the antibody molecule with papain and a reducing agent, and Fv fragments. 

In addition, techniques have been developed for the production of chimerized {See, e^g., 
Boss, M. et al., U.S. Patent No. 4,816,397; and Cabilly, S. et al., U.S. Patent No. 5,585,089 each of 
5 which is incorporated herein by reference in its entirety) humanized antibodies {See, e^g,, Queen, U.S. 
Patent No. 5,585,089, which is incorporated herein by reference in its entirety.) An immunoglobulin 
light or heavy chain variable region consists of a "framework" region interrupted by three 
hypervariable regions, referred to as complementarily determining regions (CDRs). The extent of the 
framework region and CDRs have been precisely defined (See, "Sequences of Proteins of 

10 Immunological Interest", Kabat, E. et al., U.S. Department of Health and Human Services (1983)). 
Briefly, humanized antibodies are antibody molecules from non-human species having one or more 
CDRs from the non-human species and a framework from a human immunoglobulin molecule. 

The antibodies of the invention may also be labelled in the same manner as described 
above for the nucleic probes of the invention such as an enzymatic, fluorescent or radioactive type 

15 labelling. 

The invention relates, in addition, to a method for the detection and/or the 
identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
microorganism in a biological sample, characterized in that it comprises the following steps: 

a) bringing the biological sample (biological tissue or fluid) into contact with a mono- or 
20 polyclonal antibody according to the invention (under conditions allowing an immunological reaction 

between the said antibodies and the polypeptides of the bacterium belonging to the species Chlamydia 
trachomatis or to an associated microorganism which may be present in the biological sample, that is, 
under conditions suitable for the formation of immune complexes); 

b) detecting the antigen-antibody complex which may be formed. 

25 Also falling within the scope of the invention is a kit or set for the detection and/or 

the identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
microorganism, characterized in that it comprises the following components: 

a polyclonal or monoclonal antibody according to the invention, labelled where appropriate; 
where appropriate, a reagent for constituting the medium appropriate for carrying out the 
30 immunological reaction; 

a reagent allowing the detection of the antigen-antibody complexes produced by the 
immunological reaction, it being possible for this reagent also to carry a label, or to be capable of 
being recognized in turn by a labelled reagent, more particularly in the case where the said 
monoclonal or polyclonal antibody is not labelled; 
35 - where appropriate, reagents for carrying out the lysis of the cells in the sample tested. 
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The principle of the DNA chip which was explained above may also be used to 
produce protein «chips» on which the support has been coated with a polypeptide or an antibody 
according to the invention, or arrays thereof, in place of the DNA. These protein «chips» make it 
possible, for example, to analyse the biomolecular interactions (BIA) induced by the affinity capture 
5 of target analytes onto a support coated, for example, with proteins, by surface plasma resonance 
(SPR). Reference may be made, for example, to the techniques for coupling proteins onto a solid 
support which are described in EP 524 800 or to the methods describing the use of biosensor-type 
protein chips such as the BIAcore-type technique (Pharmacia) (Arlinghaus et al., 1997, Krone et al„ 
1997, Chatelier et al., 1995). These polypeptides or antibodies according to the invention, capable of 

10 specifically binding antibodies or polypeptides derived from the sample to be analysed, may thus be 
used in protein chips for the detection and/or the identification of proteins in samples. The said 
protein chips may in particular be used for infectious diagnosis and may preferably contain, per chip, 
several polypeptides and/or antibodies of the invention of different specificity, and/or polypeptides 
and/or antibodies capable of recognizing microorganisms different from Chlamydia trachomatis. 

15 Accordingly, the subject of the present invention is also the polypeptides and the 

antibodies according to the invention, characterized in that they are immobilized on a support, in 
particular of a protein chip. 

The protein chips, characterized in that they contain al least one polypeptide or one 
antibody according to the invention immobilized on the support of the said chip, also form part of the 

20 invention. 

The invention comprises, in addition, a protein chip according to the invention, 
characterized in that it contains, in addition, at least one polypeptide of a microorganism different 
from Chlamydia trachomatis or at least one antibody directed against a compound of a microorganism 
different from Chlamydia trachomatis, immobilized on the support of the said chip. 

25 The invention also relates to a kit or set for the detection and/or the identification of 

bacteria belonging to the species Chlamydia trachomatis or to an associated microorganism, or for the 
detection and/or the identification of a microorganism characterized in that it comprises a protein chip 
according to the invention. 

The subject of the present invention is also a method for the detection and/or the 

30 identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
microorganism in a biological sample, characterized in that it uses a nucleotide sequence according to 
the invention. 

More particularly, the invention relates to a method for the detection and/or the 
identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
35 microorganism in a biological sample, characterized in that it comprises the following steps: 
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a) where appropriate, isolation of the DNA from the biological sample to be analysed, or 
optionally production of a cDNA from the RNA in the biological sample; 

b) specific amplification of the DNA of bacteria belonging to the species Chlamydia trachomatis 
or to an associated microorganism with the aid of at least one primer according to the invention; 

5 c) detection of the amplification products. 

These may be detected, for example, by the molecular hybridization technique using a 
nucleic probe according to the invention. This probe will be advantageously labelled with a 
nonradioactive (cold probe) or radioactive element. 

For the purposes of the present invention, «DNA in the biological sample» or «DNA 
10 contained in the biological sample» will be understood to mean either the DNA present in the 
biological sample considered, or optionally the cDNA obtained after the action of a reverse 
transcriptase-type enzyme on the RNA present in the said biological sample. 

Another aim of the present invention consists in a method according to the invention, 
characterized in that it comprises the following steps: 
15 a) bringing a nucleotide probe according to the invention into contact with a biological sample, 
the DNA contained in the biological sample having, where appropriate, been previously made 
accessible to hybridization, under conditions allowing the hybridization of the probe to 
complementary base pairs of the DNA of a bacterium belonging to the species Chlamydia trachomatis 
or to an associated microorganism; 
20 b) detecting the hybridization complex formed between the nucleotide probe and the DNA in the 
biological sample. 

The present invention also relates to a method according to the invention, 
characterized in that it comprises the following steps: 

a) bringing a nucleotide probe immobilized on a support according to the invention into contact 
25 with a biological sample, the DNA in the sample having, where appropriate, been previously made 

accessible to hybridization, under conditions allowing the hybridization of the probe to the DNA of a 
bacterium belonging to the species Chlamydia trachomatis or to an associated microorganism; 

b) bringing the hybrid formed between the nucleotide probe immobilized on a support and the 
DNA contained in the biological sample, where appropriate after removal of the DNA in the 

30 biological sample which has not hybridized with the probe, into contact with a labelled nucleotide 
probe according to the invention; 

c) detecting the new hybrid formed in step b). 

According to an advantageous embodiment of the method for the detection and/or the 
identification defined above, it is characterized in that, prior to step a), the DNA in the biological 
35 sample is primer-extended and/or amplified beforehand with the aid of at least one primer according 
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to the invention. 

The invention relates, in addition, to a kit or set for the detection and/or the 
identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
microorganism, characterized in that it comprises the following components: 
5 a) a nucleotide probe according to the invention; 

b) where appropriate, the reagents necessary for carrying out a hybridization reaction; 

c) where appropriate, at least one primer according to the invention as well as the reagents (e.g., 
polymerase and/or deoxynucleotide triphosphates) necessary for a DNA amplification reaction. 

The invention also relates to a kit or set for the detection and/or the identification of 
10 bacteria belonging to the species Chlamydia trachomatis or to an associated microorganism, 
characterized in that it comprises the following components: 

a) a nucleotide probe, called capture probe, according to the invention; 

b) an oligonucleotide probe, called detection probe, according to the invention; 

c) where appropriate, at least one primer according to the invention as well as the reagents (e.g., 
1 5 polymerase and/or deoxynucleotide triphosphates) necessary for a DNA amplification reaction. 

The invention also relates to a kit or set for the detection and/or the identification of 
bacteria belonging to the species Chlamydia trachomatis or to an associated microorganism, 
characterized in that it comprises the following components: 
a) at least one primer according to the invention; 

20 b) where appropriate, the reagents necessary for carrying out a DNA amplification reaction; 

c) where appropriate, a component which makes it possible to check the sequence of the 
amplified fragment, more particularly an oligonucleotide probe according to the invention. 

The invention relates, in addition, to a kit or set for the detection and/or the 
identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 

25 microorganism, or for the detection and/or the identification of a microorganism characterized in that 
it comprises a DNA chip according to the invention. 

The invention also relates to a method or to a kit or set according to the invention for 
the detection and/or the identification of bacteria belonging to the species Chlamydia trachomatis, 
characterized in that the said primer and/or the said probe according to the invention are chosen from 

30 the nucleotide sequences specific to the species Chlamydia trachomatis, in that the said polypeptides 
according to the invention are chosen from the polypeptides specific to the species Chlamydia 
trachomatis and in that the said antibodies according to the invention are chosen from the antibodies 
directed against the polypeptides according to the invention chosen from the polypeptides specific to 
the species Chlamydia trachomatis. 

35 Preferably, the said method or the said kit or set above according to the invention, for 
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the detection and/or the identification of bacteria belonging to the species Chlamydia trachomatis is 
characterized in that the said primer and/or the said probe or the said polypeptides are chosen from the 
nucleotide sequences or polypeptides according to the invention which have been identified as being 
specific to the species Chlamydia trachomatis and in that the said antibodies according to the 
5 invention are chosen from the antibodies directed against the polypeptides according to the invention 
chosen from the polypeptides identified as being specific to the species Chlamydia trachomatis. 

The invention relates, in addition, to a method or a kit or set according to the 
invention for the diagnosis of predispositions to, or of a condition caused by, genital diseases which 
are induced or worsened by a Chlamydia trachomatis infection. 
1 0 The invention also relates to a method or a kit or set according to the invention for the 

diagnosis of predispositions to, or of conditions caused by, eye diseases induced or worsened by a 
Chlamydia trachomatis infection. 

The invention also relates to a method or a kit or set according to the invention for the 
diagnosis of predispositions to, or of conditions caused by, systemic diseases, in particular of the 
15 lymphatic system, which are induced or worsened by a Chlamydia trachomatis infection. 

According to another aspect, the subject of the invention is the use of polypeptides 
according to the invention, of cells transformed with a vector according to the invention and/or of 
transformed animals according to the invention, for the biosynthesis or the biodegradation of organic 
or inorganic compounds. 

20 As has been mentioned above, the nucleotide sequences of the invention were 

identified by homology with sequences known to encode, for example, polypeptides or fragments of 
enzymatic polypeptides involved in the biosynthesis or the biodegradation of organic or inorganic 
molecules. 

It is thus possible to use the said polypeptides of the invention in a similar manner for 
25 the biosynthesis or the biodegradation of organic or inorganic compounds of industrial or therapeutic 
interest (called compounds of interest). 

Among these polypeptides, there may be mentioned in particular the enzymes 
involved in metabolism, such as the proteolytic enzymes, amino transferases, glucose metabolism, or 
the enzymes which may be used in the biosynthesis of sugars, amino acids, fatty acids, polypeptides, 
30 nucleotides, nucleic acids or any other organic or inorganic compound or in the biodegradation of 
organic or inorganic compounds. 

Among these polypeptides, there may be mentioned, in addition, the mutated or 
modified enzymes corresponding to mutated or modified polypeptides according to the invention 
which may also be used for the biosynthesis or the biodegradation of organic or inorganic compounds 
35 at the industrial level, such as, for example, the production of compounds of interest, the reprocessing 
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of manufacturing residues applied to the food industries, to the papermaking industry or to the 
chemical and pharmaceutical industries. 

The methods of biosynthesis or biodegradation of organic or inorganic compounds, 
characterized in that they use a polypeptide or one of its representative fragments according to the 
5 invention, transformed cells according to the invention and/or a transformed animal according to the 
invention, also form part of the invention. 

The invention relates, in addition, to the use of a nucleotide sequence according to the 
invention, of a polypeptide according to the invention, of an antibody according to the invention, of a 
cell according to the invention, and/or of a transformed animal according to the invention, for the 

10 selection of an organic or inorganic compound capable of modulating, regulating, inducing or 
inhibiting the expression of genes, and/or of modifying the cellular replication of eukaryotic or 
prokaryotic cells or capable of inducing, inhibiting or worsening the pathologies linked to an infection 
by Chlamydia trachomatis or one of its associated microorganisms. 

The invention also comprises screening assays that comprise method of selecting 

15 compounds capable of binding to a polypeptide, fusion polypeptide, or one of its representative 
fragments according to the invention, capable of binding to a nucleotide sequence according to the 
invention, or capable of recognizing an antibody according to the invention, and/or capable of 
modulating, regulating, inducing or inhibiting the expression of genes, and/or of modifying the growth 
or the cellular replication of eukaryotic or prokaryotic cells, or capable of inducing, inhibiting or 

20 worsening, in an animal or human organism, the pathologies linked to an infection by Chlamydia 
trachomatis or one of its associated microorganisms, characterized in that it comprises the following 
steps: 

a) bringing the said compound into contact with the said polypeptide, the said 
nucleotide sequence, with a transformed cell according to the invention and/or administering the said 

25 compound to a transformed animal according to the invention; 

b) determining the capacity of the said compound to bind with the said 
polypeptide or the said nucleotide sequence, or to modulate, regulate, induce or inhibit the expression 
of genes, or to modulate growth or cellular replication, or to induce, inhibit or worsen in the said 
transformed animal, the pathologies linked to an infection by Chlamydia trachomatis or one of its 

30 associated microorganisms. 

The transformed cells and/or animals according to the invention may advantageously 
serve as a model and may be used in methods for studying, identifying and/or selecting compounds 
capable of being responsible for pathologies induced or worsened by Chlamydia trachomatis, or 
capable of preventing and/or of treating these pathologies such as, for example, genital, eye or 

35 systemic diseases, especially of the lymphatic system. In particular, the transformed host cells, in 



WO 99/28475 PCT/IB98/01939 
73 

particular bacteria of the Chlamydia family whose transformation with a vector according to the 
invention may, for example, increase or inhibit its infectivity, or modulate the pathologies usually 
induced or worsened by the infection, may be used to infect animals in which the onset of pathologies 
will be monitored. These nontransformed animals, infected for example with transformed Chlamydia 
5 bacteria, may serve as a study model. In the same manner, the transformed animals according to the 
invention may, for example, exhibit predispositions to genital and/or eye and/or systemic diseases, 
especially of the lymphatic system, and thus be used in methods for selecting compounds capable of 
preventing and/or of treating the said diseases. The said methods using the said transformed cells 
and/or transformed animals form part of the invention. 

10 The compounds capable of being selected may be organic compounds such as 

polypeptides or carbohydrates or any other organic or inorganic compounds already known, or new 
organic compounds produced using molecular modelling techniques and obtained by chemical or 
biochemical synthesis, these techniques being known to persons skilled in the art. 

The said selected compounds may be used to modulate the growth and/or the cellular 

15 replication of Chlamydia trachomatis or any other associated microorganism and thus to control 
infection by these microorganisms. The said compounds according to the invention may also be used 
to modulate the growth and/or the cellular replication of all eukaryotic or prokaryotic cells, in 
particular tumour cells and infectious microorganisms, for which the said compounds will prove 
active, the methods which make it possible to determine the said modulations being well known to 

20 persons skilled in the art. 

Compound capable of modulating the growth of a microorganism is understood to 
designate any compound which makes it possible to act, to modify, to limit and/or to reduce the 
development, the growth, the rate of proliferation and/or the viability of the said microorganism. 

This modulation may be achieved, for example, by an agent capable of binding to a 

25 protein and thus of inhibiting or of potentiating its biological activity, or capable of binding to a 
membrane protein of the outer surface of a microorganism and of blocking the penetration of the said 
microorganism into the host cell or of promoting the action of the immune system of the infected 
organism directed against the said microorganism. This modulation may also be achieved by an agent 
capable of binding to a nucleotide sequence of a DNA or RNA of a microorganism and of blocking, 

30 for example, the expression of a polypeptide whose biological or structural activity is necessary for 
the growth or for the reproduction of the said microorganism. 

Associated microorganism is understood to designate in the present invention any 
microorganism whose gene expression may be modulated, regulated, induced or inhibited, or whose 
growth or cellular replication may also be modulated by a compound of the invention. Associated 

35 microorganism is also understood to designate in the present invention any microorganism containing 
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nucleotide sequences or polypeptides according to the invention. These microorganisms may, in some 
cases, contain polypeptides or nucleotide sequences identical or homologous to those of the invention 
may also be detected and/or identified by the detection and/or identification methods of kit according 
to the invention and may also serve as a target for the compounds of the invention. 
5 The invention relates to the compounds capable of being selected by a method of 

selection according to the invention. 

The invention also relates to a pharmaceutical composition comprising a compound 
chosen from the following compounds: 
a nucleotide sequence according to the invention; 
10 a polypeptide or fusion polypeptide according to the invention; 
a vector according to the invention; 
an antibody according to the invention; and 

a compound capable of being selected by a method of selection according to the invention, optionally 
in combination with a pharmaceutically acceptable vehicle or carrier. 
15 An effective quantity is understood to designate a sufficient quantity of the said 

compound or antibody, or of a polypeptide of the invention, which makes it possible to modulate the 
growth of Chlamydia trachomatis or of an associated microorganism. 

The invention also relates to a pharmaceutical composition according to the invention 
for the prevention or the treatment of an infection by a bacterium belonging to the species Chlamydia 
20 trachomatis or by an associated microorganism. 

The invention relates, in addition, to an immunogenic and/or vaccine composition, 
characterized in that it comprises one or more polypeptides according to the invention and/or one or 
more hybrid polypeptides according to the invention. 

The invention also comprises the use of a transformed cell according to the invention, 
25 for the preparation of a vaccine composition. 

The invention also relates to a vaccine composition, characterized in that it contains a 
nucleotide sequence according to the invention, a vector according to the invention and/or a 
transformed cell according to the invention. 

The invention also relates to the vaccine compositions according to the invention, for 
30 the prevention or the treatment of an infection by a bacterium belonging to the species Chlamydia 
trachomatis or by an associated microorganism. 

The invention also relates to the use of DNA encoding polypeptides of Chlamydia 
trachomatis, in particular antigenic determinants, to be formulated as vaccine compositions. In 
accordance with this aspect of the invention, the DNA of interest is engineered into an expression 
35 vector under the control of regulatory elements, which will promote expression of the DNA, i.e., 
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promoter or enhancer elements. In one preferred embodiment, the promoter element may be cell- 
specific and permit substantial transcription of the DNA only in predetermined cells. The DNA may 
be introduced directly into the host either as naked DNA (U.S. Patent No. 5,679,647 incorporated 
herein by reference in their entirety) or formulated in compositions with other agents which may 
5 facilitate uptake of the DNA including viral vectors, i.e., adenovirus vectors, or agents which facilitate 
immunization, such as bupivicaine and other local anesthetics (U.S. Patent 5,593,972 incorporated 
herein by reference in their entirety), saponins (U.S. Patent 5,739,118 incorporated herein by 
reference in their entirety) and cationic polyamines (published international application WO 96/10038 
incorporated herein by reference in their entirety). 
10 The DNA sequence encoding the antigenic polypeptide and regulatory element may 

be inserted into a stable cell line or cloned microorganism, using techniques, such as targeted 
homologous recombination, which are well known to those of skill in the art, and described e.g., in 
Chappel, U.S. Patent No. 4,215,051; Skoultchi, WO 91/06667 each of which is incorporated herein by 
reference in its entirety. 

15 Such cell lines and microorganisms may be formulated for vaccine purposes. In yet 

another embodiment, the DNA sequence encoding the antigenic polypeptide and regulatory element 
may be delivered to a mammalian host and introduced into the host genome via homologous 
recombination (See, Chappel, U.S. Patent No. 4,215,051; Skoultchi, WO 91/06667 each of which is 
incorporated herein by reference in its entirety. 

20 Preferably, the immunogenic and/or vaccine compositions according to the invention 

intended for the prevention and/or the treatment of an infection by Chlamydia trachomatis or by an 
associated microorganism will be chosen from the immunogenic and/or vaccine compositions 
comprising a polypeptide or one of its representative fragments corresponding to a protein, or one of 
its representative fragments, of the cellular envelope of Chlamydia trachomatis. The vaccine 

25 compositions comprising nucleotide sequences will also preferably comprise nucleotide sequences 
encoding a polypeptide or one of its fragments corresponding to a protein, or one of its representative 
fragments, of the cellular envelope of Chlamydia trachomatis. 

Among these preferred immunogenic and/or vaccine compositions, the most preferred 
are those comprising a polypeptide or one of its representative fragments, or a nucleotide sequence or 

30 one of its representative fragments whose sequences are chosen from the nucleotide or amino acid 
sequences identified in this functional group and listed above. 

The polypeptides of the invention or their representative fragments entering into the 
immunogenic compositions according to the invention may be selected by techniques known to 
persons skilled in the art, such as for example on the capacity of the said polypeptides to stimulate T 

35 cells, which results, for example, in their proliferation or the secretion of interleukins, and which 
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leads to the production of antibodies directed against the said polypeptides. 

In mice, in which a weight dose of the vaccine composition comparable to the dose 
used in humans is administered, the antibody reaction is tested by collecting serum followed by a 
study of the formation of a complex between the antibodies present in the serum and the antigen of 
5 the vaccine composition, according to the customary techniques. 

According to the invention, the said vaccine compositions will be preferably in 
combination with a pharmaceutically acceptable vehicle and, where appropriate, with one or more 
appropriate immunity adjuvants. 

Various types of vaccines are currently available for protecting humans against 

10 infectious diseases: attenuated live microorganisms (M bovis - BCG for tuberculosis), inactivated 
microorganisms (influenza virus), acellular extracts (Bordetella pertussis for whooping cough), 
recombinant proteins (hepatitis B virus surface antigen), polysaccharides (pneumococci). Experiments 
are underway on vaccines prepared from synthetic peptides or from genetically modified 
microorganisms expressing heterologous antigens. Even more recently, recombinant plasmid DNAs 

15 carrying genes encoding protective antigens were proposed as an alternative vaccine strategy. This 
type of vaccination is carried out with a particular plasmid derived from an E. coli plasmid which 
does not replicate in vivo and which encodes only the vaccinal protein. Animals were immunized by 
simply injecting the naked plasmid DNA into the muscle. This technique leads to the expression of 
the vaccine protein in situ and to a cell-type (CTL) and a humoral type (antibody) immune response. 

20 This double induction of the immune response is one of the main advantages of the technique of 
vaccination with naked DNA. 

The vaccine compositions of the present invention can be evaluated in in vitro and in 
vivo animal models prior to host, e^, human, administration. For example, in vitro neutralization 
assays such as those described by Peterson et al. (1988) can be utilized. The assay described by 

25 Peterson et al. (1988) is suitable for testing vaccine compositions directed toward either Chlamydia 
trachomatis or Chlamydia pneumoniae. 

Briefly, hyper-immune antisera is diluted in PBS containing 5% guinea pig serum, as 
a complement source. Chlamydiae (10 4 IFU; inclusion forming units) are added to the antisera 
dilutions. The antigen-antibody mixtures are incubated at 37°C for 45 minutes and inoculated into 

30 duplicate confluent Hep-2 or HeLa cell monolayers contained in glass vials (e.g. , 15 by 45 mm), 
which have been washed twice with PBS prior to inoculation. The monolayer cells are infected by 
centrifugation at 1000X g for 1 hour followed by stationary incubation at 37° for 1 hour. Infected 
monolayers are incubated for 48 or 72 hours, fixed and stained with a Chlamydiae specific antibody, 
such as anti-MOMP for C. trachomatis, etc. Inclusion-bearing cells are counted in ten fields at a 

35 magnification of 200X. Neutralization titer is assigned based on the dilution that gives 50% 
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inhibition as compared to control monolayers/IFU. 

The efficacy of vaccine compositions can be determined in vivo by challenging 
animal models of Chlamydia trachomatis infection, e.g., guinea pigs or mice, with the vaccine 
compositions. For example, in vivo vaccine composition challenge studies in the guinea pig model of 
5 Chlamydia trachomatis infection can be performed. Briefly, female guinea pigs weighing 450 to 500 
g are housed in an enviornmentally controlled room with a 12 hour light-dark cycle and immunized 
with vaccine compositions via a variety of immunization routes. Post-vaccination, guinea pigs are 
infected in the genital tract with the agent of guinea pig inclusion conjunctivitis (GPIC), which has 
been grown in HeLa or McCoy cells (Rank et al. (1988)). Each animal receives approximately 

10 1 .4x 10 7 inclusion forming units (IFU) contained in 0.05 ml of sucrose-phosphate-glutamate buffer, pH 
7.4 (Schacter, J. (1980)). The course of infection monitored by determining the percentage of 
inclusion-bearing cells by indirect immunofluorescence with GPIC specific antisera, or by Giemsa- 
stained smear from a scraping from the genital tract (Rank et al. (1988)). Antibody titers in the serum 
is determined by an enzyme-linked immunosorbent assay. 

15 Alternatively, in vivo vaccine composition challenge studies can be performed in the 

murine model of Chlamydia trachomatis (Morrison et al., 1995). Briefly, female mice 7 to 12 weeks 
of age receive 2.5 mg of depoprovera subcutaneously at 10 and 3 days before vaginal infection. Post- 
vaccination, mice are infected in the genital tract with 1,500 inclusion-forming units of Chlamydia 
trachomatis contained in 5ml of sucrose-phosphate-glutamate buffer, pH. 7.4. The course of infection 

20 is monitored by determining the percentage of inclusion-bearing cells by indirect immunofluorence 
with Chlamydia trachomatis specific antisera, or by a Giemsa-stained smear from a scraping from the 
genital tract of an infected mouse. The presence of antibody titers in the serum of a mouse is 
determined by an enzyme-linked immunosorbent assay. 

The vaccine compositions comprising nucleotide sequences or vectors into which the 

25 said sequences are inserted are in particular described in International Application No. WO 90/1 1092 
and also in International Application No. WO 95/1 1307. 

The nucleotide sequence constituting the vaccine composition according to the 
invention may be injected into the host after having been coupled to compounds which promote the 
penetration of this polynucleotide inside the cell or its transport up to the cell nucleus. The resulting 

30 conjugates may be encapsulated into polymeric microparticles, as described in International 
Application No. WO 94/27238 (Medisorb Technologies International). 

According to another embodiment of the vaccine composition according to the 
invention, the nucleotide sequence, preferably a DNA, is complexed with the DEAE-dextran (Pagano 
et al., 1967) or with nuclear proteins (Kaneda et al., 1989), with lipids (Feigner et al., 1987) or 

35 encapsulated into liposomes (Fraley et al., 1980) or alternatively introduced in the form of a gel 
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facilitating its transfection into the cells (Midoux et al., 1993, Pastore et al., 1994). The 
polynucleotide or the vector according to the invention may also be in suspension in a buffer solution 
or may be combined with liposomes. 

Advantageously, such a vaccine will be prepared in accordance with the technique 
5 described by Tacson et al. or Huygen et al. in 1996 or alternatively in accordance with the technique 
described by Davis et al. in International Application No. WO 95/1 1307. 

Such a vaccine may also be prepared in the form of a composition containing a vector 
according to the invention, placed under the control of regulatory elements allowing its expression in 
humans or animals. It is possible, for example, to use, as vector for the in vivo expression of the 

10 polypeptide antigen of interest, the plasmid pcDNA3 or the plasmid pcDNAl/neo, both marketed by 
Invitrogen (R&D Systems, Abingdon, United Kingdom). It is also possible to use the plasmid 
VUns.tPA, described by Shiver et al. in 1995. Such a vaccine will advantageously comprise, in 
addition to the recombinant vector, a saline solution, for example a sodium chloride solution. 

The immunogenic compositions of the invention can be utilized as part of methods of 

15 immunization, wherein such methods comprise administering to a host, e.g., a human host, an 
immunizing amount of the immunogenic compositions of the invention. In a preferred embodiment, 
the method of immunizing is a method of immunizing against Chlamydia trachomatis. 

A pharmaceutical^ acceptable vehicle is understood to designate a compound or a 
combination of compounds entering into a pharmaceutical or vaccine composition which does not 

20 cause side effects and which makes it possible, for example, to facilitate the administration of the 
active compound, to increase its life and/or its efficacy in the body, to increase its solubility in 
solution or alternatively to enhance its preservation. These pharmaceutically acceptable vehicles are 
well known and will be adapted by persons skilled in the art according to the nature and the mode of 
administration of the active compound chosen. 

25 As regards the vaccine formulations, these may comprise appropriate immunity 

adjuvants which are known to persons skilled in the art, such as, for example, aluminium hydroxide, a 
representative of the family of muramyl peptides such as one of the peptide derivatives of N-acetyl- 
muramyl, a bacterial lysate, or alternatively incomplete Freund ' s adjuvant, Stimulon™ QS-21 (Aquila 
Biopharmaceuticals, Inc., Framinham, MA), MPL™ (3-0-deacylated monophosphoryl lipid A; RIBI 

30 ImmunoChem Research, Inc., Hamilton, MT), aluminum phosphate, IL-12 (Genetics Institute, 
Cambridge, MA). 

Preferably, these compounds will be administered by the systemic route, in particular 
by the intravenous route, by the intranasal, intramuscular, intradermal or subcutaneous route, or by the 
oral route. More preferably, the vaccine composition comprising polypeptides according to the 
35 invention will be administered several times, spread out over time, by the intradermal or subcutaneous 
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route. 

Their optimum modes of administration, dosages and galenic forms may be 
determined according to criteria which are generally taken into account in establishing a treatment 
adapted to a patient, such as for example the patient's age or body weight, the seriousness of his 
5 general condition, tolerance of the treatment and the side effects observed. 

The invention comprises the use of a composition according to the invention for the 
treatment or the prevention of genital diseases which are induced or worsened by Chlamydia 
trachomatis. 

Finally, the invention comprises the use of a composition according to the invention 
10 for the treatment or the prevention of eye diseases which are induced or worsened by the presence of 
Chlamydia trachomatis. 

Finally, the invention comprises the use of a composition according to the invention 
for the treatment or the prevention of systemic diseases, especially of the lymphatic system, which are 
induced or worsened by the presence of Chlamydia trachomatis. 
15 Other characteristics and advantages of the invention appear in the following 

examples and figures: 

Legend to the figures : 

Figure 1 : Line for the production of Chlamydia trachomatis sequences 
20 Figure 2 : Analysis of the sequences and assembling 
Figure 3 : Finishing techniques 

Figure 3a) : Assembly map 

Figure 3b) : Determination and use of the orphan ends of the contigs 

25 EXAMPLES 

Cells 

The Chlamydia trachomatis LGV2 strain used is identified to have over 98% 
homology with the outer membrane protein sequences ompl (CHTMOMPA) and omp2 
(CHTOMP2A) of the Chlamydia trachomatis serovar L2/434/Bu strain. 
30 The Chlamydia trachomatis LGV2 strain is cultured on mouse fibroblasts (McCoy 

cells), obtained from the American Type Culture Collection, under the reference ATCC CRL-1696. 

Culture of the cells 

The mouse fibroblasts are cultured in 75-ml cell culture flasks (Corning). The culture 
35 medium is Dulbecco ' s modified cell culture medium (Gibco BRL No. 04101965) supplemented with 
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MEM amino acids (Gibco BRL - No. 04301140) L (5 ml per 500 ml of medium) and 5% foetal calf 
serum (Gibco BRL No. 10270 batch 40G8260K) without antibiotics or antifungals. 

The cell culture stock is maintained in the following manner. The cell cultures are 
examined under an inverted microscope. 24 hours after confluence, each cellular lawn is washed with 
5 PBS (Gibco BRL No. 041 14190), rinsed and then placed for 5 min in an oven in the presence of 3 ml 
of trypsine (Gibco BRL No. 25200056). The cellular lawn is then detached and then resuspended in 
120 ml of culture medium, the whole is stirred in order to make the cellular suspension homogeneous. 
30 ml of this suspension are then distributed per cell culture flask. The flasks are kept in a C0 2 oven 
(5%) for 48 hours at a temperature of 37°C. The cell stock is maintained so as to have available daily 
10 16 flasks of subconfluent cells. It is these subconfluent cells which will be used so as to be infected 
with Chlamydia. 25-ml cell culture flasks are also used, these flasks are prepared in a similar manner 
but the volumes used for maintaining the cells are the following: 1 ml of trypsine, 28 ml of culture 
medium to resuspend the cells, 7 ml of culture medium are used per 25-ml flask. 

15 Infection of the cells with Chlamydia 

Initially, the Chlamydiae are obtained frozen (at -70°C), in suspension in a volume of 
1 millilitre. This preparation is slowly thawed, 500 ul are collected and brought into contact with 
subconfluent cells, which are obtained as indicated above, in a 25-ml cell culture flask, containing 
1 ml of medium, so as to cover the cells. The flask is then centrifuged at 2000 rpm in a «swing» rotor 

20 for microtitre plates, the centrifuge being maintained at a temperature of 35°C. After centrifugation, 
the two flasks are placed in an oven at 35°C for three hours. 6 ml of culture medium containing 
cycloheximide (1 ug/ml) are then added and the flask is stored at 35°C. After 48 hours, the level of 
infection is evaluated by direct immunofluorescence and by the cytopathogenic effect caused to the 
cells. 

25 

Direct immunofluorescence 

Starting with infected cells, which were obtained as indicated above, a cellular smear 
is deposited with a Pasteur pipette on a microscope slide. The cellular smear is fixed with acetone for 
10 minutes; after draining the acetone, the smear is covered with 30 ul of murine monoclonal 

30 antibodies directed against MOMP (major outer membrane protein) of Chlamydia (Syva, Biomerieux) 
labelled with fluorescein isothiocyanate. The whole is then incubated in a humid chamber at a 
temperature of 37°C. The slides are then rinsed with water, slightly dried, and then after depositing a 
drop of mounting medium, a coverslip is mounted before reading. The reading is carried out with the 
aid of a fluorescence microscope equipped with the required filters (excitation at 490 nm, emission at 

35 520 nm). 
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Harvesting of the Chlamydia trachomatis 

After checking the infection by direct immunofluorescence, carried out as indicated 
above, the culture flasks are opened under a sterile cabinet, sterile glass beads with a diameter of the 
5 order of a millimeter are placed in the flask. The flask is closed and then vigorously stirred while 
being maintained horizontally, the cellular lawn at the bottom, so that the glass beads can have a 
mechanical action on the cellular lawn. Most of the cells are thus detached or broken; the effect of the 
stirring is observed under an optical microscope so as to ensure proper release of Chlamydiae. 

10 Large-scale infection of the cell cultures 

The product of the Chlamydiae harvest (culture medium and cellular debris) is 
collected with a pipette, and distributed into three cell culture flasks containing subconfluent L cells, 
obtained as indicated above. The cells thus inoculated are placed under gentle stirring (swing) in an 
oven at 35°C. After one hour, the flasks are kept horizontally in an oven so that the culture medium 

15 covers the cells for 3 hours. 30 ml of culture medium containing actydione (1 ug/ml) are then added to 
each of the flasks. The culture flasks are then stored at 35°C for 48 hours. The cells thus infected are 
examined under an optical microscope after 24 hours, the cytopathogenic effect is evaluated by the 
appearance of cytoplasmic inclusions which are visible under an inverted optical microscope. After 
48 hours, the vacuoles containing the Chlamydiae occupy the cytoplasm of the cell and push the cell 

20 nucleus sideways. At this stage, numerous cells are spontaneously destroyed and have left free 
elementary bodies in the culture medium. The Chlamydiae are harvested as described above and are 
either frozen at -80°C or used for another propagation. 

Purification of the Chlamydiae 

25 The product of the Chlamydia harvests, stored at -80°C, is thawed on a water bath at 

room temperature. After thawing, each tube is vigorously stirred for one minute and immersed for one 
minute in an ultrasound tank (BRANSON 1200); the tubes are then stirred by inverting before being 
centrifuged for 5 min at 2000 rpm. The supernatant is carefully removed and kept at cold temperature 
(ice). The supernatant is vigorously stirred and then filtered on nylon filters having pores of 5 microns 

30 in diameter on a support (Nalgene) allowing a delicate vacuum to be established under the nylon 
filter. For each filtration, three nylon filters are superposed; these filters are replaced after every 40 ml 
of filtrate. Two hundred milliliters of filtration product are kept at cold temperature, and then after 
stirring by inverting, are centrifuged at 10,000 rpm for 90 min, the supernatant is removed and the 
pellet is taken up in 10 ml of 10 mM Tris, vigorously vortexed and then centrifuged at 10,000 rpm for 

35 90 min. The supernatant is removed and the pellet is taken up in a buffer (20 mM Tris pH 8.0, 50 mM 
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KC1, 5 mM MgCl 2 ) to which 800 units of DNAse I (Boehringer) are added. The whole is kept at 37°C 
for one hour. One ml of 0.5 M EDTA is then added, and the whole is vortexed and frozen at -20°C. 

Preparation of the DNA 

5 The Chlamydiae purified above are thawed and subjected to a proteinase K 

(Boehringer) digestion in a final volume of 10 ml. The digestion conditions are the following: 
0.1 mg/ml proteinase K, 0.1 % SDS at 55°C, stirring every 10 min. The product of digestion is then 
subjected to a double extraction with phenol-chloroform, two volumes of ethanol are added and the 
DNA is directly recovered with a Pasteur pipette having one end in the form of a hook. The DNA is 
10 dried on the edge of the tube and then resuspended in 500 ul of 2 mM Tris pH 7.5. The DNA is stored 
at 4°C for at least 24 hours before being used for the cloning. 

Cloning of the DNA 

After precipitation, the DNA is quantified by measuring the optical density at 

15 260 nm. Thirty ug of Chlamydia DNA are distributed into 10 tubes of 1.5 ml and diluted in 300 jal of 
water. Each of the tubes is subjected to 10 applications of ultrasound lasting for 0.5 sec in a sonicator 
(Unisonix XL2020). The contents of the 10 tubes are then grouped and concentrated by successive 
extractions with butanol (Sigma B1888) in the following manner: two volumes of butanol are added 
to the dilute DNA mixture. After stirring, the whole is centrifuged for five minutes at 2500 rpm and 

20 the butanol is removed. This operation is repeated until the volume of the aqueous phase is less than 
1 ml. The DNA is then precipitated in the presence of ethanol and of 0.5 M sodium acetate pH 5.4, 
and then centrifuged for thirty minutes at 15,000 rpm at cold temperature (4°C). The pellet is washed 
with 75% ethanol, centrifuged for five minutes at 15,000 rpm and dried at room temperature. A tenth 
of the preparation is analysed on a 0.8% agarose gel. Typically, the size of the DNA fragments thus 

25 prepared is between 200 and 8000 base pairs. 

To allow the cloning of the DNA obtained, the ends are repaired. The DNA is 
distributed in an amount of 10 ug/tube, in the following reaction medium: 100 ul final volume, 1 H 
buffer (Biolabs 201L), 0.5 pi BSA 0.05 mg/ml, 0.1 mM dATP, 0.1 mM each of dGTP, dCTP or 
dTTP, 60,000 IU T4 DNA polymerase. The reaction is incubated for thirty minutes at 16°C. The 

30 contents of each of the tubes are then grouped before carrying out an extraction with phenol- 
chloroform and then precipitating the aqueous phase as described above. After this step, the DNA 
thus prepared is phosphorylated. For that, the DNA is distributed into tubes in an amount of 10 ug per 
tube, and then in a final volume of 50 pi, the reaction is prepared in the following manner: 1 mM 
ATP, 1 x kinase buffer, 10 IU T4 polynucleotide kinase (Biolabs 201L). The preparation is incubated 

35 for thirty minutes at 37°C. The contents of the tubes are combined and a phenol-chloroform extraction 
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and then a precipitation are carried out in order to precipitate the DNA. The latter is then suspended in 
1 pi of water and then the DNA fragments are separated according to their size on a 0.8% agarose gel 
(1 x TAE). The DNA is subjected to an electric field of 5 V/cm and then visualized on a UV table. 
The fragments whose size varies between 1200 and 2000 base pairs are selected by cutting out the gel. 
5 The gel fragment thus isolated is placed in a tube and then the DNA is purified with the Qiaex kit 
(20021 Qiagen), according to the procedure provided by the manufacturer. 

Preparation of the vector 

14 ug of the cloning vector pGEM-5Zf (Promega P2241) are diluted in a final volume 

10 of 150 pi and are subjected to digestion with the restriction enzyme EcoRV 300 IU (Biolabs 195S) 
according to the protocol and with the reagents provided by the manufacturer. The whole is placed at 
37°C for 150 min and then distributed in the wells of a 0.8% agarose gel subjected to an electric field 
of 5 V/cm. The linearized vector is visualized on a UV table, isolated by cutting out the gel and then 
purified by the Qiaex kit (Qiagen 20021) according to the manufacturer ' s recommendations. The 

15 purification products are grouped in a tube, the volume is measured and then half the volume of 
phenol is added and the whole is vigorously stirred for 1 min. Half the volume of chloroform-isoamyl 
alcohol 24:1 is added and vigorously stirred for 1 min. The whole is centrifuged at 15,000 rpm for 
5 min at 4°C, the aqueous phase is recovered and transferred into a tube. The DNA is precipitated in 
the presence of 0.3 M sodium acetate, pH 5.4 and 3 volumes of ethanol and placed at -20°C for 

20 1 hour. The DNA is then centrifuged at 15,000 rpm for 30 min at 4°C, the supernatant is removed 
while preserving the pellet, washed twice with 70% ethanol. After drying at room temperature, the 
DNA is suspended in 25 pi of water. 

Phosphorylation of the vector 

25 25 H 1 of the vector prepared in the preceding step are diluted in a final volume of 

500 pi of the following reaction mixture: 

After repair, the DNA is subjected to a phenol-chloroform extraction and a 
precipitation, the pellet is then taken up in 10 pi of water, the DNA is quantified by measuring the 
optical density at 260 nm. The quantified DNA is ligated into the vector PGEm-5Zf(+) prepared by 

30 the restriction enzyme EcoRV and dephosphorylated (see preparation of the vector). The ligation is 
carried out under three conditions which vary in the ratio between the number of vector molecules and 
the number of insert molecules. Typically, an equimolar ratio, a ratio of 1:3 and a ratio of 3:1 are used 
for the ligations which are, moreover, carried out under the following conditions: vector 
PGEm-5Zf(+) 25 ng, cut DNA, ligation buffer in a final volume of 20 pi with T4 DNA ligase 

35 (Amersham E70042X); the whole is then placed in a refrigerator overnight and then a phenol- 
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chloroform extraction and a precipitation are carried out in a conventional manner. The pellet is taken 
up in 5 pi of water. 

Transformation of the bacteria 
5 Plating of the bacteria 

Petri dishes containing LB Agar medium containing ampicillin (50 pg/ml), Xgal 
(280 pg/ml) [5-bromo-4-chloro-indolyl-beta-D-galactopyranoside (Sigma B-4252)], IPTG (140 ug/ml) 
[isopropyl-beta-D-thiogalactoside (Sigma 1-6758)] are used, 50 and 100 pi of bacteria are plated for 
each of the ligations. The Petri dishes are placed upside down at 37°C for 15 to 16 hours in an oven. 
10 The number of «recombinant» positive clones is evaluated by counting the white colonies and the 
blue colonies which are thought to contain the vector alone. 

Evaluation of the «recombinant» positive clones : 

Ninety-four white colonies and two blue colonies are collected with the aid of sterile 
15 cones and are deposited at the bottom of the wells of plates designed for carrying out the 
amplification techniques. 30 pi of the following reaction mixture are added to each well: 1.7 mM 
MgCl 2l 0.2 mM each of dATP, dCTP, dGTP and dTTP, two synthetic oligonucleotides corresponding 
to sequences flanking the cloning site on either side and orienting the synthesis of the DNA in a 
convergent manner (0.5 pM RP and PU primers, 1 U TAQ polymerase (GibcoBRL 18038-026)). 
20 The colonies thus prepared are subjected to a temperature of 94°C for 5 min and then 

to 30 thermal cycles composed of the following steps: 94°C for 40 s, 50°C for 30 s, 72°C for 180 s. 
The reaction is then kept for 7 min at 72°C and then kept at 4°C. 

The amplification products are deposited on an agarose gel (0.8%), stained with 
ethidium bromide, subjected to electrophoresis, and then analysed on an ultraviolet table. The 
25 presence of an amplification fragment having a size greater than 500 base pairs indicates the presence 
of an insert. The bacterial clones are then prepared so as to study the sequence of their insert. 

Sequencing 

To sequence the inserts of the clones obtained as above, these were amplified by PCR 
30 on bacteria cultures carried out overnight using the primers for the vectors flanking the inserts. The 
sequence of the ends of these inserts (on average 500 bases on each side) was determined by 
automated fluorescent sequencing on an ABI 377 sequencer, equipped with the ABI Prism DNA 
Sequencing Analysis software (version 2.1.2). 
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Analysis of the sequences 
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The sequences obtained by sequencing in a high-yield line (Figure 1) are stored in a 
database; this part of the production is independent of any treatment of the sequences. The sequences 
are extracted from the database, avoiding all the regions of inadequate quality, that is to say the 
regions for which uncertainties are observed on the sequence at more than 95%. After extraction, the 
5 sequences are introduced into a processing line, the diagram of which is described in Figure 2. In a 
first path of this processing line, the sequences are assembled by the Gap4 software from R. Staden 
(Bonfield et al., 1995) (OS UNIX/SUN Solaris); the results obtained by this software are kept in the 
form of two files which will be used for a subsequent processing. The first of these files provides 
information on the sequence of each of the contigs obtained. The second file represents all the clones 

10 participating in the composition of all the contigs as well as their positions on the respective contigs. 

The second processing path uses a sequence assembler (TIGR-Asmg assembler 
UNIX/SUN Solaris); the results of this second processing path are kept in the form of a file in the 
TIGR-Asmg format which provides information on the relationship existing between the sequences 
selected for the assembly. This assembler is sometimes incapable of linking contigs whose ends 

15 overlap over several hundreds of base pairs. 

The results obtained from these two assemblers arc compared with the aid of the 
BLAST program, each of the contigs derived from one assembly path being compared with the 
contigs derived from the other path. 

For the two processing paths, the strict assembly parameters are fixed (95% 

20 homology, 30 superposition nucleotides). These parameters avoid 3 to 5% of the clones derived from 
eukaryotic cells being confused with sequences obtained from the clones derived from Chlamydia 
trachomatis. The eukaryotic sequences are however preserved during the course of this project; the 
strategy introduced, which is described below, will be designed, inter alia, not to be impeded by these 
sequences derived from contaminating clones. 

25 The results of these two assemblers are processed in a software developed for this 

project. This software operates on a Windows NT platform and receives, as data, the results derived 
from the STADEN software and/or the results derived from the TIGR-Asmg assembler, the software, 
results, after processing of the data, in the determination of an assembly map which gives the 
proximity relationship and the orientation of the contigs in relation to one another (Figure 3a). Using 

30 this assembly map, the software determines all the primers necessary for finishing the project. This 
treatment, which will be detailed below, has the advantage of distinguishing the isolated sequences 
derived from the contaminations, by the DNA eukaryotic cells, of the small-sized sequences clearly 
integrated into the project by the relationships which they establish with contigs. In order to allow, 
without any risk of error, the arrangement and the orientation of the contigs in relation to one another, 

35 a statistical evaluation of the accuracy of the names «naming» of sequence is made from the results of 
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«contigation». This evaluation makes it possible to give each of the clone plates, as well as each of 
the subsets of plates, a weight which is inversely proportional to probable error rate existing in the 
«naming» of the sequences obtained from this plate or from a subset of this plate. In spite of a low 
error rate, errors may occur throughout the steps of production of the clones and of the sequences. 
5 These steps are numerous, repetitive and although most of them are automated, others, like the 
deposition in the sequencers, are manual; it is then possible for the operator to make mistakes such as 
the inversion of two sequences. This type of error has a repercussion on the subsequent processing of 
the data, by resulting in relationships (between the contigs) which do not exist in reality, then in 
attempts at directed sequencing between the contigs which will end in failure. It is because of this that 

10 the evaluation of the naming errors is of particular importance since it allows the establishment of a 
probabilistic assembly map from which it becomes possible to determine all the clones which will 
serve as template to obtain sequences separating two adjacent contigs. Table 2 of parent U.S. 
Application Serial No. 60/107077 filed November 4, 1998, French application 97-15041 filed 
November 28, 1997 and French application 97-16034 filed December 17, 1997, each of which is 

15 incorporated by reference herein in its entirety, gives the clones and the sequences of the primers 
initially used during the initial operations. 

To avoid the step which consists in ordering and then preparing the clones by 
conventional microbiological means, outer and inner primers oriented towards the regions not yet 
sequenced are defined by the software. The primers thus determined make it possible to prepare, by 

20 PCR, a template covering the nonsequenced region. It is the so-called outer primers (the ones most 
distant from the region to be sequenced) which are used to prepare this template. The template is then 
purified and a sequence is obtained on each of the two strands during 2 sequencing reactions which 
each use one of the 2 inner primers. In order to facilitate the use of this approach, the two outer 
primers and the two inner primers are prepared and then stored on the same position of 4 different 

25 96-well plates. The two plates containing the outer primers are used to perform the PCRs which serve 
to prepare the templates. These templates will be purified on purification columns preserving the 
topography of the plates. Each of the sequences are obtained using primers situated on one and then 
on the other of the plates containing the inner primers. This distribution allows a very extensive 
automation of the process and results in a method which is simple to use for finishing the regions not 

30 yet sequenced. Table 3 of parent U.S. Application Serial No. 60/107077 filed November 4, 1998, 
French application 97-15041 filed November 28, 1997 and French application 97-16034 filed 
December 17, 1997, each of which is incorporated by reference herein in its entirety, gives the names 
and the sequences of the primers used for finishing Chlamydia trachomatis. 

Finally, a number of contigs exist in a configuration where one of their ends is not 

35 linked to any other contig end (Figure 3b) by a connecting clone relationship (a connecting clone is 
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defined as a clone having one sequence end on a contig and the other end of its sequence on another 
contig; furthermore, this clone must be derived from a plate or a subset of plates with adequate 
naming quality). For the Chlamydia trachomatis project, this particular case occurred 37 times. Two 
adjacent PCR primers orienting the synthesis of the DNA towards the end of the consensus sequence 
5 are defined for each of the orphan ends of the consensus sequence. The primer which is closest to the 
end of the sequence is called the inner primer whereas the primer which is more distant from the end 
of the sequence is called the outer primer. The outer primers are used to explore the mutual 
relationship between the orphan ends of the different contigs. The presence of a single PCR product 
and the possibility of amplifying this product unambiguously using the inner primers evokes the 
10 probable relationship between the contigs on which the primers which allowed the amplification are 
situated. This relationship will be confirmed by sequencing and will allow the connection between the 
orphan ends of the consensus sequences. This strategy has made it possible to obtain a complete map 
of the Chlamydia trachomatis chromosome and then to finish the project. 

15 Quality control 

All the bases not determined with certainty in the chromosomal sequence were noted 
and the density of uncertainties was measured on the entire chromosome. The regions with a high 
density of uncertainties were noted and the PCR primers spanning these regions were drawn and are 
represented in Table 4 of parent U.S. Application Serial No. 60/107077 filed November 4, 1998, 

20 French application 97-15041 filed November 28, 1997 and French application 97-16034 filed 
December 17, 1997 each of which is incorporated by reference herein in its entirety. 

Data banks 

Local reorganizations of major public banks were used. The protein bank used 
25 consists of the nonredundant fusion of the Genpept bank (automated translation of GenBank, NCBI; 
Benson et al., 1996). 

The entire BLAST software (public domain, Altschul et al., 1990) for searching for 
homologies between a sequence and protein or nucleic data banks was used. The significance levels 
used depend on the length and the complexity of the region tested as well as the size of the reference 
30 bank. They were adjusted and adapted to each analysis. 

The results of the search for homologies between a sequence according to the 
invention and protein or nucleic data banks are presented and summarized in Table 1 below. 
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Table 1: List of coding chromosome regions and homologies between these regions and the sequence 
banks. 
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Legend to Table 1: Open reading frames are identified with the GenMark software 
version 2.3A (GenePro), the template used is Chlamydia trachomatis of order 4 on a length of 
196 nucleotides with a window of 1 2 nucleotides and a minimum signal of 0.5. These reading frames 
are numbered in order of appearance on the chromosome, starting with ORF2 (ORF column). The 
5 positions of the beginning and of the end are then given in column 2 (position). When the position of 
the beginning is greater than the position of the end, this means that the region is encoded by the 
strand complementary to the sequence which was given in the sequence SEQ ID No. 1. 

All the putative products were subjected to a search for homology on GENPEPT 
(release 103 for SEQ ID No. 2 to SEQ ID No. 1076 and release 108 for SEQ ID No. 1077 to SEQ ID 

10 No. 1197 with the BLASTp software (Altschul etal. 1990), with, as parameters, the default 
parameters with the exception of the expected value E set at 10 s (for SEQ ED No. 2 to SEQ ID No. 
1076) and P value set at e 10 (for SEQ ID No. 1077 to SEQ ID No. 1197). Subsequently, only the 
identities greater than 30% (1% column) were taken into account. The description of the most 
homologous sequence is given in the Homology column; the identifier for the latter sequence is given 

15 in the ID column and the animal species to which this sequence belongs is given in the Species 
column. The Homology score is evaluated by the sum of the blast scores for each region of homology 
and reported in the Score column. Table 1 also reflects data from additional ORF finder programs as 
defined below. 



20 Materials and methods: transmembrane domains : 

The DAS software was used as recommended by the authors (Cserzo et al., 1997). 

This method uses, to predict the transmembrane domains, templates derived from a 
sampling of selected proteins. All the regions for which a «Cutoff» greater than 1.5 was found by the 
program were taken into account. 

25 

Additional ORF Finder Programs 

For this analysis, two additional ORF finder programs were used to predict potential 
open reading frames of a minimum length of 74 amino acids; Glimmer (Salzberg, S.L., Delcher, A., 
Kasif, S., and W. White. 1998. Microbial gene identification using interpolated Markov models. 
30 Nucleic Acids Res. 26:544-548.), and an in-house written program. The in-house program used a 
very simple search algorithm. The analysis required the that the genomic DNA sequence text be in 
the 5' to 3' direction, the genome is circular, and that TAA, TAG, and TGA are stop codons. The 
search parameters were as follows: 

(1) A search for an ORF that started with a GTG codon was performed. If no GTG codons 
35 were found, then a search for an ATG codon was performed. However, if a GTG codon was found, 
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then a search downstream for a ATG codon was performed. All start and stop nucleotide positions 
were recorded. 

(2) A search for an ORF that started with a TTG codon was performed. If no TTG codons 
were found, then a search for a ATG codon was performed. However, if a TTG codon was found, 
then a search downstream for a ATG codon was performed. All start and stop nucleotide positions 
were recorded. 

(3) The analysis described in steps 1 and 2 were repeated for the opposite strand of DNA 
sequence. 

(4) A search for ORFs that determined all ORF lengths using start and stop positions in the 
same reading frames was performed. 

(5) All ORFs whose DNA length was less than 225 nucleotides were eliminated from the 

search. 

Surface Exposed Protein Search Criteria 

Potential cell surface vaccine targets are outer membrane proteins such as porins, 
lipoproteins, adhesions and other non-integral proteins. In Chlamydia psittaci, the major 
immunogens is a group of putative outer membrane proteins (POMPs) and no homologs have been 
found in Chlamydia trachomatis and Chlamydia trachomatis by traditional analysis (Longbottom, D., 
Russell, M., Dunbar, S.M., Jones, G.E., and A.J. Herring. 1998. Molecular Cloning and 
Characterization of the Genes Coding for the Highly Immunogenic Cluster of 90-Kilodalton Envelope 
Proteins from Chlamydia psittaci Subtype That Causes Abortion in Sheep. Infect Immun 66:1317- 
1324.) However, utilizing the criteria described below, several ORFs encoding outer membrane 
proteins have been identified in Chlamydia trachomatis, all of which may represent vaccine 
candidates. Any ORF which met any one of the criteria described below were considered to encode a 
surface exposed protein. 

Protein homology searches of the translated ORFs were done using the Blastp 2.0 tool 
(Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J„ Zhang, Z., Miller, W., and DJ. Lipman. 
1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. 
Nucleic Acids Res. 25:3389-3402). An ORF was labeled surface exposed if the translated ORF had 
homology to a known, or hypothetical, or putative surface exposed protein with a P score less than 
e" 10 . 

Most, if not all, proteins that are localized to the membrane of bacteria, via a 
secretory pathway, contain a signal peptide. The software program SignalP, was used to analyze the 
amino acid sequence of an ORF for such a signal peptide (Nielsen, H., Engelbrecht. J., Brunak, S., 
and G. von Heijne. 1997. Identification of prokaryotic and eukaryotic signal peptides and prediction 
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of their cleavage sites. Protein Engineering 10:1-6.) The first 60 N-terminal amino acids of each ORF 
were analyzed by SignalP using the Gram-negative software database. The output generates four 
separate values, maximum C, maximum Y, maximum S, and mean S. The S-score, or signal region, is 
the probability of the position belonging to the signal peptide. The C-score, or cleavage site, is the 
5 probability of the position being the first in the mature protein. The Y-score is the geometric average 
of the C-score and a smoothed derivative of the S-score. A conclusion of either a Yes or No is given 
next to each score. If all four conclusions are Yes and the C-terminal amino acid is either a 
phenylalanine (F) or a tyrosine (Y), the ORF was labelled outer membrane (Struyve, M., Moons, M., 
and J. Tommassen. 1991. Carboxy-terminal Phenylalanine is Essential for the Correct Assembly of a 

10 Bacterial Outer Membrane Protein. J. Mol. Biol. 218:141-148.) 

The program called Psort was used to determine the localization of a protein based on 
its signal sequence, recognition of transmembrane segments, and analysis of its amino acid 
composition (Nakai, K., and M. Kanehisa. 1991. Expert system for predicting protein localization 
sites in gram-negative bacteria. Proteins 1 1:95-1 10.) An ORF is considered to be an outer membrane 

15 protein if the output data predicts the ORF encoded protein as outer membrane with a certainty value 
of 0.5 or better and whose value is at least twice as large as the next predicted localized certainty 
value. 

Finally, ORFs that were not predicted to be outer membrane or surface exposed, 
based on the above criteria, were further analyzed. The Blastp output data for these ORFs were 
20 searched using various general and specific keywords, suggestive of known cell surface exposed 
proteins. An ORF was labeled surface exposed if the keywords matched had a Blastp hit with a P 
score less than e" 10 , and there was no better data indicating otherwise. The following is a list of the 
searched keywords: 



25 


Adhesion 


Adhesm 


Invasin 




Invasion 


Extension 


Omp 




Outer Surface 


Porin 


Outer Membrane 




Cell Surface 


Cell Wall 


Pilin 




Flagellar sheath 


Cir 


ChuA 


30 


CopB 


ExeD 


FadL 




FecA 


FepA 


FhuA 




FmdC 


FomA 






GspD 


HemR 


HgbA 




Hgp 


HmbR 


HmuR 


35 


HMW 


HrcC 


Hrp 
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InvG 




LbpA 


LcrQ 




MxiD 


MOMP 


PilE 


HpaA 


NolW 




OpcP 


5 OpnP 


o^ A 


Por PA 


PhoE 


PldA 




PscC 


PulD 


PupA 


QuiX 


RafY 


ScrY 


SepC 


ShuA 


SomA 


10 SpiA 


Tbpl 


Yop 


YscC 


mip 


Tol 


Pilus 


BtuB 





Those ORFs that did not meet the minimum requirement for being an outer membrane protein based 
on the above search criteria but which were homologous to identified outer membrane ORFs in 

15 Chlamydia pneumoniae were included. The Chlamydia pneumoniae genome (French patent 
application No. 97-14673, filed 21 November 1997) was analyzed using the above search criteria and 
a number of outer membrane ORFs were identified. These Chlamydia pneumoniae ORFs were then 
tested against the Chlamydia trachomatis genome using Blastp. Any Chlamydia trachomatis ORF 
with a Blastp P value less than e" 10 against a Chlamydia pneumoniae outer membrane was included in 

20 this section, if there was no better data indicating otherwise. A list of ORFs in the Chlamydia 
trachomatis genome encoding putative surface exposed proteins is set forth above in the specification. 

Identification of Putative Lipoproteins in the Genome of Chlamydia trachomatis 
Lipoproteins are the most abundant post-translationally modified bacterial secretory 

25 proteins (Pugsley, A. P.. 1993. The complete general secretory pathway in Gram-negative bacteria. 
Microbiol. Rev. 57:50-108). The characteristic features of lipoproteins are a thiol-linked 
diacylglyceride and an amine-linked monoacyl group on the cysteine that becomes the amino-terminal 
residue after signal peptide cleavage by Signal Peptidase E. (Pugsley, A. P.. 1993. The complete 
general secretory pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108). The identification 

30 of putative lipoproteins from the genomic sequencing of Chlamydia trachomatis was done by 
examining the deduced amino acid sequence of identified ORFs for the presence of a signal peptide 
with a Signal Peptidase II cleavage site analogous to the consensus sequence for prolipoprotein 
modification and processing reactions (Hayashi, S., and H. C. Wu. 1992. Identification and 
characterization of lipid-modified proteins in bacteria, p. 261-285. In N. M. Hooper and A. J. Turner 

35 (ed.) Lipid modification of proteins: A practical approach. Oxford University Press, New York; 
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Sutcliffe, I. C. and R. R. B. Russell. 1995. Lipoproteins of Gram-positive bacteria. J. Bacteriol. 
177:1123-1128). 

The deduced amino acid sequences of Chlamydia trachomatis ORFs were initially screened 
for the most basic of lipoprotein characteristics, a cysteine in the first 30 amino acids of the deduced 
protein. ORFs with a standard start codon (ATG, GTG, or TTG) and having one or more of the 
following characteristics were selected for direct analysis of their first 30 amino acids: 

(a) Significant Signal P value (at least two out-of-the four values are Yes) 

(b) PSORT value indicating membrane passage (EVl-inner membrane, Peri-periplasm, or 
OM-outer membrane) 

(c) Identification of the word lipoprotein among the ORF Blastp data set. 

(d) A Blastp value of <e" 10 with a putative lipoprotein from Chlamydia pneumoniae 
(French application No. 97-14673 filed 21 November 1997). 

The first 30 amino acids encoded by each ORF in this set were analyzed for the 
characteristics commonly found in lipoprotein signal peptides (Pugsley, A. P.. 1993. The complete 
general secretory pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108; Hayashi, S., and H. 
C. Wu. 1992. Identification and characterization of lipid-modified proteins in bacteria, p. 261-285. In 
N. M. Hooper and A. J. Turner (ed.) Lipid modification of proteins: A practical approach. Oxford 
University Press, New York; Sutcliffe, I. C. and R. R. B. Russell. 1995. Lipoproteins of Gram- 
positive bacteria. J. Bacteriol. 177:1123-1128.) Putative lipoprotein signal peptides were required to 
have a cysteine between amino acid 10 and 30 and reach a minimum score of three based on the 
following criteria for lipoprotein signal peptides: 

(a) Identification of specific amino acids in specific positions around the cysteine which are 
part of the consensus Signal Peptidase II cleavage site (Hayashi, S., and H. C. Wu. 1992. 
Identification and characterization of lipid-modified proteins in bacteria, p. 261-285. In 
N. M. Hooper and A. J. Turner (ed.) Lipid modification of proteins: A practical approach. 
Oxford University Press, New York); Sutcliffe, I. C. and R. R. B. Russell. 1995. 
Lipoproteins of Gram-positive bacteria. J. Bacteriol. 177:1123-1128). Since the 
identification of the cleavage site is the most important factor in identifying putative 
lipoproteins, each correctly positioned amino acid contributed toward reaching the 
minimum score of three. 

(b) A hydrophobic region rich in alanine and leucine prior to the cleavage site (Pugsley, A. 
P.. 1993. The complete general secretory pathway in Gram-negative bacteria. Microbiol. 
Rev. 57:50-108) contributed toward reaching the minimum score of three. 

(c) A short stretch of hydrophilic amino acids greater than or equal to 1 , usually lysine or 
arginine, following the N-terminal methionine (Pugsley, A. P.. 1993. The complete general secretory 
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pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108) contributed toward reaching the 
minimum score of three. 

A list of ORFs in the Chlamydia trachomatis genome encoding putative lipoproteins is set forth above 
in the specification. 

5 

LPS-Related ORFs of Chlamydia trachomatis 

Lipopolysaccharide (LPS) is an important major surface antigen of Chlamydia cells. 
Monoclonal antibodies (Mab) directed against LPS of Chlamydia pneumoniae have been identified 
that can neutralize the infectivity of Chlamydia pneumoniae both in vitro and in vivo (Peterson et al. 

10 1988). Similar results are expected utilizing monoclonal antibodies against LPS of Chlamydia 
trachomatis. LPS is composed of lipid A and a core oligosaccharide portion and is phenotypically of 
the rough type (R-LPS) (Lukacova, M., Baumann, M., Brade, L„ Mamat, U., Brade, H. 1994. 
Lipopolysaccharide Smooth-Rough Phase Variation in Bacteria of the Genus Chlamydia. Infect. 
Immun. June 62(6):2270-2276.) The lipid A component is composed of fatty acids which serve to 

15 anchor LPS in the outer membrane. The core component contains sugars and sugar derivatives such 
as a trisaccharide of 3-deoxy-D-manno-octulosonic acid (KDO) (Reeves, P.R., Hobbs, M., Valvano, 
M.A., Skurnik, M., Whitfield, C, Coplin, D., Kido, N., Klena, J., Maskell, D., Raetz, C.R.H., Rick, 
P.D. 1996. Bacterial Polysaccharide Synthesis and Gene Nomenclature pp. 10071-10078, Elsevier 
Science Ltd.). The KDO gene product is a multifunctional glycosyltransferase and represents a 

20 shared epitope among the Chlamydia. For a review of LPS biosynthesis see, e.g., Schnaitman, C.A., 
Klena, J.D. 1993. Genetics of Lipopolysaccharide Biosynthesis in Enteric Bacteria. Microbiol. Rev. 
57:655-682. 

A text search of the ORF Blastp results identified several genes that are involved in 
Chlamydial LPS production with a P score less than e' 10 . The following key-terms were used in the 
25 text search: KDO, CPS (Capsular Polysaccharide Biosynthesis), capsule, LPS, rfa, rfb, rfc, rfe, rha, 
rhl, core, epimerase, isomerase, transferase, pyrophosphorylase, phosphatase, aldolase, heptose, 
manno, glucose, lpxB, fibronectin, fibrinogen, fucosyltransferase , lie, lgt, pgm, tolC, rol, ChoP, 
phosphorylcholine, waaF, PGL-Tbl. A list of ORFs in the Chlamydia trachomatis genome encoding 
putative polypeptides involved in LPS biosynthesis is set forth above in the specification. 

30 

Type ITl And Other Secreted Products 

Type m secretion enables gram-negative bacteria to secrete and inject pathogenicity 
proteins into the cytosol of eukaryotic host cells (Hueck, C. J„ 1998. Type EI Protein Secretion 
Systems in Bacterial Pathogens of Animals and Plants. In Microbiology and Molecular Biology 
35 Reviews. 62:379-433.) These secreted factors often resemble eukaryotic signal transduction factors, 
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thus enabling the bacterium to redirect host cell functions (Lee, C.A., 1997. Type III secretion 
systems: machines to deliver bacterial proteins into eukaryotic cells? Trends Microbiol. 5:148-156.) 
In an attempt to corrupt normal cellular functions, Chlamydial pathogenicity factors injected into the 
host cytosol will nonetheless, as cytoplasmic constituents be processed and presented in the context of 
5 the Major Histocompatibility Complex (MHC class I). As such, these pathogenicity proteins 
represent MHC class I antigens and will play an important role in cellular immunity. Also included in 
this set are secreted non-type HI products that may play a role as vaccine components. 

A text search of the ORF Blastp results identified genes that are involved in 
Chlamydia trachomatis protein secretion with a P score less than e" 10 . The following key-terms were 

10 used in the text search in an effort to identify surface localized or secreted products: Yop, Lcr, Ypk, 
Exo, Per, Pop, Ipa, Vir, Ssp, Spt, Esp, Tir, Hrp, Mxi, hemolysin, toxin, IgA protease, cytolysin, tox, 
hap, secreted and Mip. 

Chlamydia trachomatis ORFs that did not meet the above keyword search criteria, 
but have homologs in Chlamydia pneumoniae that do meet the search criteria are included herein. 

15 The Chlamydia pneumoniae genome (French patent application No. 97-14673, filed 21 November 
1997) was analyzed using the above search criteria and a number of ORFs were identified. These 
Chlamydia pneumoniae ORFs were tested against the Chlamydia trachomatis genome using Blastp. 
Any Chlamydia trachomatis ORF with a Blastp P value < e" 10 against a Chlamydia pneumoniae 
homolog, identified using the above search criteria, was included. A list of ORFs in the Chlamydia 

20 trachomatis genome encoding putative secreted proteins is set forth above in the specification. 

Chlamydia trachomatis RGD Recognition Sequence 

Proteins that contain Arg-Gly-Asp (RGD) attachment site, together with integrins that 
serve as their receptor constitute a major recognition system for cell adhesion. The RGD sequence is 

25 the cell attachment site of a large number of adhesive extracellular matrix, blood, and cell surface 
proteins and nearly half of the known integrins recognize this sequence in their adhesion protein 
ligands. There are many RGD containing microbial proteins such as the penton protein of adenovirus, 
the coxsackie virus, the foot and mouth virus and pertactin, a 69 kDa (kilodalton) surface protein of 
Bordetella pertussis, that serve as ligands through which these microbes bind to integrins on the cell 

30 surfaces and gain entry into the cell. The following provides evidence supporting the importance of 
RGD in microbial adhesion: 

a) The adenovirus penton base protein has a cell rounding activity and when penton base was 
expressed in E. coli, it caused cell rounding and cells adhered to polystyrene wells coated with the 
protein. Mutant analysis showed that both these properties required an RGD sequence. Virus 

35 mutants with amino acid substitutions in the RGD sequence, showed much less adherence to HeLa S3 
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cells, and also were delayed in virus reproduction (Bai, M., Harfe, B., and Freimuth, P. 1993. 
Mutations That Alter an RGD Sequence in the Adenovirus Type 2 Penton Base Protein Abolish Its 
Cell-Rounding Activity and Delay Virus Reproduction in Flat Cells. J. Virol. 67:5198-5205). 

5 b) It has been shown that attachment and entry of coxsackie virus A9 to GMK cells were 

dependent on an RGD motif in the capsid protein VP1. VP1 has also been shown to bind 0^3 
integrin, which is a vitronectin receptor (Roivainen, M., Piirainen, L., Hovi, T., Virtanen, I., Riikonen, 
T., Heino, J., and Hyypia, T. 1994. Entry of Coxsackievirus A9 into Host Cells: Specific Interactions 
with ccvfb Integrin, the Vitronectin Receptor Virology, 203:357-65). 

10 

c) During the course of whooping cough, Bordetella pertussis interacts with alveolar 
macrophages and other leukocytes on the respiratory epithelium. Whole bacteria adheres by means 
of two proteins, filamentous hemagglutinin (FHA) and pertussis toxin. FHA interacts with two 
classes of molecules on macrophages, galactose containing glycoconjugates and the integrin CR3. 
15 The interaction between CR3 and FHA involves recognition of RGD sequence at the positions 1097- 
1099 in FHA (Relman, D., Tuomanen, E., Falkow, S., Golenbock, D. T., Saukkonen, K., and Wright, 
S. D. " Recognition of a Bacterial Adhesin by an Integrin: Macrophage CR3 Binds Filamentous 
Hemagglutinin of Bordetella Pertussis." Cell, 61:1375-1382 (1990)). 

20 d) Pertactin, a 69 kDa outer membrane protein of Bordetella pertussis, has been shown to 

promote attachment of Chinese hamster ovary cells (CHO). This attachment is mediated by 
recognition of RGD sequence in pertactin by integrins on CHO cells and can be inhibited by synthetic 
RGD containing peptide homologous to the one present in pertactin (Leininger, E., Roberts, M., 
Kenimer, J. G., Charles, I. G., Fairweather, N., Novotny, P., and Brennan, M. J. 1991. Pertactin, an 

25 Arg-Gly-Asp containing Bordetella pertussis surface protein that promotes adherence of mammalian 
cells Proc. Natl. Acad. Sci. USA, 88:345-349). 

e) The RGD sequence is highly conserved in the VP1 protein of foot and mouth disease virus 
(FMDV). Attachment of FMDV to baby hamster kidney cells (BHK) has been shown to be mediated 
30 by VP1 protein via the RGD sequence. Antibodies against the RGD sequence of VP1 blocked 
attachment of virus to BHK cells (Fox, G., Parry, N. R., Barnett, P. V., McGinn, B., Rowland, D. J., 
and Brown, F. 1989. The Cell Attachment Site on Foot-and-Mouth Disease Virus Includes the Amino 
Acid Sequence RGD (Arginine-Glycine-Aspartic Acid) J. Gen. Virol., 70:625-637). 

It has been demonstrated that bacterial adherence can be based on interaction of a 
35 bacterial adhesin RGD sequence with an integrin and that bacterial adhesins can have multiple 
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binding site characteristic of eukaryotic extracellular matrix proteins. RGD recognition is one of the 
important mechanisms used by microbes to gain entry into eukaryotic cells. 

The complete deduced protein sequence of the Chlamydia trachomatis genome was 
searched for the presence of RGD sequence. There were a total of 38 ORFs that had one or more 
5 RGD sequences. Not all RGD containing proteins mediate cell attachment. It has been shown that 
RGD containing peptides that have proline immediately following the RGD sequence are inactive in 
cell attachment assays (Pierschbacher & Ruoslahti. 1987. Influence of stereochemistry of the 
sequence Arg-Gly-Asp-Xaa on binding specificity in cell adhesion. J. Biol. Chem. 262:17294-98). 
ORFs that had RGD, with proline as the amino acid following the RGD sequence were excluded from 
10 the list. Also, RGD sequence may not be available at the surface of the protein or may be present in a 
context that is not compatible with integrin binding. Since not all RGD-containing proteins are 
involved in cell attachment, several other criteria were used to refine the list of RGD-containing 
proteins. A list of ORFs in the Chlamydia trachomatis genome encoding polypeptides with RGD 
recognition sequence(s) is set forth above in the specification. 

15 

Non-Chlamydia pneumoniae ORFs 

Chlamydia trachomatis ORFs were compared to the ORFs in the Chlamydia 
pneumoniae genome (French patent application No. 97-14673, filed 21 November 1997) using Blastp. 
Any Chlamydia trachomatis ORF with a Blastp «P» value greater than e' 10 (i.e. >e" 10 ) against 
20 Chlamydia pneumoniae ORFs are included in this section. A list of ORFs in the Chlamydia 
trachomatis genome which are not found in Chlamydia pneumoniae is set forth above in the 
specification. 

Cell Wall Anchor Surface ORFs 

25 Many surface proteins are anchored to the cell wall of Gram-positive bacteria via the 

conserved LPXTG motif (Schneewind, O., Fowler, A., and Faull, K.F. 1995. Structure of the Cell 
Wall Anchor of Surface Proteins in Staphylococcus aureus. Science 268:103-106). A search of the 
proteins encoded by the Chlamydia trachomatis ORFs was done using the motif LPXTG. A list of 
ORFs in the Chlamydia trachomatis genome encoding polypeptides anchored to the cell wall is set 

30 forth above in the specification. 

ECACC Deposits 

Samples of Chlamydia trachomatis were deposited with the European Collection of 
Cell Cultures (ECACC), Salisbury, Wiltshire SP4 OJG, UK on November 26, 1998 and assigned the 
35 provisional accession number 98112618. Cells can be grown, harvested and purified, and DNA can 
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be prepared as discussed above. In order to enable recovery of specific fragments of the chromosome, 
one can run targeted PCR reactions, whose amplification products can then be sequenced and/or 
cloned into any suitable vector, according to standard procedures known to those skilled in the art. 

In addition, a pool of clones covering the Chlamydia trachomatis genome was 
5 deposited with the ECACC on November 26, 1998 and assigned provisional accession number 
98112617. The pool of clones contains a series of clones, which when taken together, cover the 
whole chromosome, with a redundancy of slightly more than ten. The total number of clones in the 
sample is 13,572. 

Table 4 lists groups of oligonucleotides to be used to amplify each of ORFs 2-1197 

10 according to standard procedures known to those skilled in the art. Such oligonucleotides are listed as 
SEQ ID Nos. 1198 to 5981. For each ORF, the following is listed: one forward primer positioned 
2,000 bp upstream of the beginning of the ORF; one forward primer positioned 200 bp upstream of 
the beginning of the ORF; one reverse primer positioned 2,000 bp downstream at the end of ORF, 
which is 2,000 bp upstream of the end site of the ORF on the complementary strand; and one reverse 

15 primer 200 bp downstream at the end of ORF, which is 200 bp upstream of the end site of the ORF on 
the complementary strand. The corresponding SEQ ID Nos. for the primers are listed in Table 4, 
where Fp is the proximal forward primer; Fd is the distal forward primer; Bp is the proximal reverse 
primer; and Bd is the distal reverse primer. The positions of the 5' ends of each of these primers on 
the nucleotide sequence of SEQ ID No. 1 are shown in Table 5. 

20 The present invention is not to be limited in scope by the specific embodiments 

described herein, which are intended as single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the scope of the invention. Indeed, 
various modifications of the invention, in addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing description and accompanying drawings. Such 

25 modifications are intended to fall within the scope of the appended claims. 

All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent application 
was specifically and individually indicated to be incorporated by reference. 
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TABLE 2 



SEQ ID NO 


begin 


stop 


preferred start 


2 


501 


208 


501 


3 


3276 


505 


3153 


4 


5068 


3242 


5062 


5 


6400 


5126 


6400 


6 


7977 


6619 


7977 


7 


8582 


8082 


8582 


8 


8995 


8591 


8995 


9 


9440 


8979 


9440 


10 


9828 


10430 


9828 


11 


10367 


11254 


10430 


12 


11245 


11916 


11245 


13 


12068 


13324 


12068 


14 


13532 


14413 


13538 


15 


14807 


15019 


14807 


16 


14932 


15969 


14977 


17 


15995 


16501 


16004 


18 


16467 


16138 


16377 


19 


18190 


17417 


18178 


20 


20521 


18437 


20518 


21 


22202 


20814 


22166 


22 


22602 


22153 


22509 


23 


22804 


22478 


22795 


24 


23183 


22824 


23180 


25 


23394 


23110 


23394 


26 


24569 


23394 


24569 


27 


26383 


24641 


26383 


28 


26640 


27710 


26640 


29 


28780 


27725 


28729 


30 


29957 


28740 


29957 


31 


30721 


30032 


30628 


32 


31281 


30520 


31254 


33 


31436 


31780 


31436 
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SEQ ID NO 


begin 


stop 


preferred start 


34 


33356 


31800 


33344 


35 


33901 


33314 


33874 


36 


34116 


35027 


34116 


37 


34988 


35359 


35027 


38 


35167 


35919 


35377 


39 


35923 


36996 


36031 


40 


37810 


37013 


37765 


41 


38207 


39085 


38252 


42 


39151 


39927 


39157 


43 


39923 


40756 


39959 


44 


40760 


42007 


40772 


45 


42175 


43116 


42229 


46 


42999 


43802 


43128 


47 


44211 


45227 


44217 


48 


46072 


45275 


46066 


49 


46340 


45975 


46331 


50 


46895 


46506 


46865 


51 


47955 


46882 


47955 


52 


48585 


48178 


48558 


53 


50072 


48630 


50012 


54 


50710 


50099 


50692 


55 


52439 


50925 


52430 




53484 


52348 


53478 


57 


54536 


53466 


54536 


58 


55086 


54595 


55104 


59 


56350 


55031 


56350 


60 


55659 


56084 


55722 


61 


56847 


58235 


56931 


62 


58423 


59181 


58423 




59185 


60195 


59194 


64 






60191 


65 


61496 


62353 


61496 


66 


62500 


63141 


62518 


67 


63396 


63983 


63396 
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SEQ ID NO 


h — ~L_ 


stop j preferred start 






64071 


64580 


69 


64285 


64656 


64285 


70 




64609 


64938 


71 




67269 


65347 


72 


67656 


68873 


67815 


73 




69233 


68892 


74 




69721 


69323 


75 




70455 


69970 


76 




71006 


70725 


77 




71086 


73185 


78 




73497 


74891 


79 




74876 


75463 


80 




75502 


77124 


81 


77000 


77299 


77012 


82 




77145 


78095 


83 




78154 


79065 


84 




79878 


81965 


85 


82639 


83271 


82642 


86 




84850 


83921 


87 


84876 


86921 


84888 


88 




87313 


88383 


89 




87805 


87458 


90 


88400 


88747 


88409 


91 


88717 


89265 


88729 


92 


55 


89732 


89355 


93 




91447 


89735 


94 




91435 


91749 


95 




91745 


92323 






92344 


92874 


97 






93945 


98 


94637 


94071 


94577 


99 


98299 


94628 


98113 


100 


98715 


98113 


98715 


101 


100228 


98741 


100195 
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bo,? ID ftU 


begin 


stop 


preferred start 




10134, 


100337 


101323 




102210 


101323 


102210 


104 


102485 


102210 


102479 


105 


104315 


102726 


104315 




105075 


1 04254 


105075 






105894 


105271 




107429 


108460 


107486 




108665 


108955 


108683 




iuy4>y 


109013 


109456 




110366 


109704 


110363 




1 1 1330 


112520 


111330 




112913 


1 13463 


112918 




113566 


1 13994 


113566 




114020 


1 14604 


114020 




114720 


1 15253 


114807 




115362 


1 15676 


115380 




116022 


1 19795 


116040 




119823 


124010 


119823 




124065 


124988 


124065 




124873 


125106 


124873 






125536 


126243 






126930 


126331 




127138 


127785 


127147 




127924 


129714 


127942 




129720 


131033 


129720 




131018 


131629 


131021 




131834 


133156 


131852 




133075 


133584 


133096 




133625 


133999 


133628 






134508 


133948 


132 


134638 






133 


137442 


140276 


137472 


134 


140733 


140335 


140727 


135 


141799 


141077 


141799 
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SEQ ID NO 


begin 


j stop 


jsreferred start 


136 


143240 


141780 


143240 




143829 


143128 


143820 




143923 


144393 


143923 


139 


144548 


146326 


144548 




146413 


147078 


146425 




147140 


148075 


147152 


142 


148115 


148549 


148115 


143 


148524 


149027 


148524 




149000 


149305 


149033 




149187 


149708 


149187 


146 


149712 


150911 


149769 




152044 


151004 


151966 




152664 


151999 


152592 




152900 


153352 


152924 


150 


153389 


153997 


153425 


151 


155276 


153984 


155228 




156544 


155231 


156544 




156806 


157525 


156809 




157489 


158955 


157534 




159104 


159961 


159104 




159916 


161220 


159916 




161183 


161593 


161228 




162354 


161623 


162354 




163013 


162363 


163013 




163941 


1 62994 


163941 




165505 


1 64474 


165505 


162 


166686 


1 66093 


166686 


163 


168171 


166729 


168171 


164 


169249 


168848 


169189 




169586 


170431 


169607 


166 


170780 




170783 


167 


171333 


172376 


171390 


168 


172309 


172722 


172309 


169 


173048 


174496 


173048 
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SEQ ID NO 


begin 


stop 


preferred stssrt 


170 


174399 


1 74968 


174399 


171 


175267 


175710 


175267 


172 


175714 


1 77009 


175735 


173 


177423 


178115 


177468 


174 


178084 


180021 


178084 


175 


1 80704 


1 80048 


180635 


176 


181398 


180631 


181398 


177 


182594 


181398 


1 82594 


178 


182895 


183656 


182895 


179 


183665 


184786 


183665 




1 86007 


184796 


186007 




186848 


186000 


186791 




187270 


186749 


1 87240 


183 


187426 


187809 


1 87429 


184 


189481 


188798 


189442 


185 


189693 


190352 


189693 


186 


190235 


190510 


190280 


187 


190785 


191786 


190824 






1 92464 


191811 




192392 


193183 


192500 




193254 


194630 


193263 


191 


195046 


194690 


195037 


192 


195184 


1 9703 1 


195193 


193 


197018 


197635 


197024 




197762 


198208 


197669 




198963 


197668 


198954 


196 


199957 


198962 


199945 


197 


200327 


199941 


200306 


198 


200685 


200266 


200598 




200962 


200585 


200962 


200 








201 


203441 


202380 


203441 


202 


203998 


203471 


203989 


203 


206449 


204059 


206434 
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oca jrs \ir\ 


begin 


stop 


preferred start 




207425 


20681 1 


207410 




207506 


208528 


207506 




208545 


209471 


208545 




209471 


210214 


209471 




210586 


210816 


210586 




211332 


210883 


211293 


210 


212978 


211374 


212972 




214134 


212875 


214134 




214710 


214168 


214701 




215143 


214754 


215128 




216705 


215236 


216705 




217917 


216892 


217911 


216 


217088 


217441 


217202 


217 


218364 


218702 


218364 


218 


218695 


219009 


218785 




219179 


219748 


219260 




219891 


220430 


219912 


221 


220499 


221074 


220505 


222 


221137 


221541 


221176 




221601 


222092 


221616 




222472 


223290 


222487 






223818 


223423 




224278 


225171 


224278 




225749 


225 1 74 


225749 




225334 


225549 


225328 


229 


226654 


225749 


226654 


230 


227299 


226769 


227170 




227646 


227161 


227646 


232 


228457 


227750 


228439 




230001 


228607 


229854 


234 


231074 






235 


231348 


233006 


231366 


236 


233059 


233829 


233059 


237 


233801 


234265 


233801 
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SEQ ID NO 




stop 


preferred start 


238 


234282 




234288 


239 


ZjOjUU 




236300 


240 


236314 


238209 




241 


238164 




2381 85 


242 


238769 






243 


242022 




242022 


244 


242846 




242846 


245 


244480 


242798 


244456 


246 


245897 


1 y 


245891 


247 


246877 






248 


247731 




247725 


249 


248585 




248573 


250 


249420 




249411 


251 


250383 




250383 


252 






251 174 


253 


2521 1 1 






254 


253088 




Z5J06O 


255 


255153 




255153 


256 


256762 






257 


25791 1 






258 


258780 




258840 


259 




261604 


259193 


260 


261622 




261 622 


261 


264125 




264134 


262 


26474 1 


265628 


264759 


263 


266416 




2664 1 6 


264 


266938 


266426 


267946 


265 


267961 




267946 


266 


268320 


268066 


268299 


267 




268205 


268510 


268 


270116 


268500 


270116 


269 


270892 


270095 


270856 


270 


271191 


271613 


271224 


271 


272219 


272932 


272243 
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begin 


stop | {deferred start 




272884 


273588 


273079 




274816 


273596 


274807 




274821 


275666 


274953 


275 




276103 


277689 






278816 


278298 






279013 


279870 




280777 


279767 


280762 






281295 


281576 




282104 


281787 


282086 




284335 


282794 


284320 




284460 


284795 


284550 




284817 


285674 


284844 




285637 


286137 


285670 




286357 


286677 


286399 




286681 


287898 


286852 




288127 


289227 


288358 




289744 


290679 


289744 




290828 


291535 


291206 






292230 


291514 




292326 


293048 


292350 






294853 


293525 






295010 


295684 






295692 


296294 




297238 


296243 


297199 




297791 


298735 


297791 




298905 


300458 


298920 




302152 


300527 


302131 




304917 


302071 


304872 




306157 


304973 


306142 




306494 




306461 


302 


306963 


306436 


306963 


303 


308773 


306977 


308758 


304 


309881 


309276 


309869 


305 


310720 


309872 


310711 
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CCA TF*> XT**** 


begin 


stop 


preferred start 




311570 


310716 


311570 




312451 


31 1972 


312439 




313435 


314364 


313462 




314340 


314738 


314409 




315526 


314741 


315445 




316507 


315665 


316507 




317284 


316529 


317284 




317592 


317338 


317592 




318470 


317499 


318416 




317599 


317874 


317599 




318947 


318477 


318887 




319342 


320142 


319342 




320544 


321497 


320682 




321485 


321937 


321497 




321901 


322362 


321943 




322301 


323140 


322325 




,23144 


324913 


323177 




325621 


324977 


325621 




326268 


325621 


326262 




326469 


327203 


326469 


326 




328150 


327302 




JZOOUj 


328204 


328602 




329066 


328734 


329066 




329663 


329292 


329648 




330666 


329608 


330663 




331161 


330670 


331161 




331731 


331177 


331731 




332404 


331721 


332404 




332779 


333021 


332779 






333589 


333149 


336 


334357 






337 


334089 


334361 


334089 


338 


335142 


334729 


335124 


339 


335195 


335602 


335234 
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begin 


stop 


preferred start 




,3^673 


335194 


335673 






335903 


336229 




337378 


336338 


337378 




339947 


337347 


339947 




340507 


341847 


340576 




341783 


342022 


341786 




342249 


342470 


342249 




342597 


343370 


342597 




343361 


344032 


343379 




343956 


344225 


343962 




344357 


345142 


344357 






345161 


345934 




347102 


346080 


347102 




347113 


347940 


347119 




330164 


348146 


350113 




350423 


351283 


350426 




352207 


3513 14 


352207 




352727 


352245 


352703 






353305 


353709 






353670 


354215 






354140 


354721 






356672 


354966 




356700 


357377 


356700 




357326 


358093 


357500 




358035 


360743 


358035 




360753 


361121 


360753 




361162 


361884 


361162 




361826 


362746 


361826 


368 


363859 


362816 


363859 




364116 


365195 


364116 


370 


365198 






371 


365479 


367320 


365614 


372 


367341 


368603 


367341 


373 


368644 


369081 


368644 



WO 99/28475 



167 



PCT/IB98/01939 



bbK^ ID iNl) 


begin 


stop 


preferred start 


374 


369088 


370251 


369088 


375 


370769 


371086 


370769 


376 


371203 


372816 


371209 


377 


373119 


373529 


373152 


378 


373614 


374204 


373776 


379 


374736 


374224 


374703 


380 


376391 


374703 


376382 




377062 


376748 


377038 




377853 


378737 


377871 




378626 


379048 


378710 


384 


379017 


379403 


379038 


385 


380009 


379641 


379967 




380187 


381470 


380187 




381473 


382567 


381473 


388 


382704 


383702 


382728 


389 


383945 


383655 


383921 




385217 


383949 


385211 




385507 


385178 


385507 




386845 


385706 


386842 




386127 


386627 


386232 






386872 


387351 




387823 


387338 


387823 




388250 


387816 


388106 


397 


389169 


388237 


389169 




389955 


389173 


390087 




390988 


389945 


390922 


400 


391514 


391810 


391514 




392410 


393996 


392413 


402 


394170 


395354 


394185 




395309 


395992 


395354 


404 






396529 


405 


397507 


396542 


397498 


406 


398753 


397401 


398747 


407 


399688 


398909 


399667 



WO 99/28475 



168 



PCT/IB98/01939 



SEQ ID NO 




stop j preferred start 


408 




399778 


400167 


409 


401224 


400034 


401209 


410 




402021 


401776 


411 


402]26 


403220 


402132 


412 


403348 


405180 


403354 


413 


403788 


403276 


403785 


414 


405165 


405920 


405165 


415 


407049 


405955 


407049 


416 




407056 


409764 


417 


410532 


411416 


410532 


418 


41 1707 


413410 


411722 


419 




412606 


413334 


420 


413404 


413952 


413449 


421 


413841 


415112 


413991 


422 


414379 


413978 


414220 


423 


416664 


415177 


416646 


424 


417456 


416740 


417456 


425 


418053 


417721 


418044 


426 


418603 


418031 


418582 


427 


419531 


418647 


419531 


428 


420 1 90 


419672 


420190 


429 


421 171 


420245 


421171 


430 


421988 


421518 


421988 


431 


422486 


423043 


422492 


432 


423226 


425079 


423295 


433 


426054 


425146 


426021 


434 




426245 


426967 


435 


427248 


427817 


427248 


436 




429886 


429623 


437 


430360 






438 


430637 


430323 


430628 


439 


430933 


431787 


430966 


440 


431658 


431987 


431688 


441 


432232 


434475 


432238 



WO 99/28475 



169 



PCT/IB98/01939 



SEQ ID NO 


- &egm 




pre ferre<I start 


442 






436269 


443 






436571 


444 






437595 




438262 


437894 


438256 


446 






439031 


447 


439339 


438986 


439339 


448 






439705 


449 


AA\C\A1 




441042 


450 




AA 1 Pl/IO 


44191 1 






441898 


442584 


452 


AAASf\S 


446388 


444505 


453 




446452 


448029 


454 




447932 


449575 


455 


450546 




450576 


456 


451623 




451401 


457 


452393 




452575 


458 






453174 


459 


453567 




453567 


460 


455430 




455403 


461 


456047 




456047 


462 


457384 




457384 


463 




458450 


457659 


464 


njoJuo 




45851 1 


465 


459839 




459839 


466 


461624 




461624 


467 




462621 


462151 


468 






463749 


469 


464048 




464063 






465848 


464760 


471 






467414 


472 


468891 


467419 


468891 


473 


469280 


468906 


469226 


474 


469349 


469675 


469406 


475 


471226 


469826 


471160 



WO 99/28475 



170 



PCT/IB98/01939 



i>h,\£ ID J\U 


begin 


stop 


j preferred start 




471624 


471106 


471609 




471954 


473267 


471954 






473695 


473252 




473982 


474527 


47405 1 




475198 


474602 


475195 




476527 


475613 


476509 




478640 


476517 


478640 




479084 


478665 


479078 




479723 


479088 


479720 




480012 


479668 


479898 




481466 


479895 


481412 




481732 


481496 


481732 




48 1 864 


483429 


481870 




483402 


484964 


483402 




484898 


487864 


484970 






485222 


485593 




488204 


489247 


488321 




488571 


488233 


488562 




489440 


490456 


489473 




492765 


490507 


492690 






492893 


492654 




493744 




493723 


498 




4y4o/j 


493875 






494869 


494609 




494835 


495365 


494955 




495174 


494872 


495174 




495687 


496634 


495732 




496295 


497176 


496523 




497703 


498515 


498222 






499239 


498301 


506 


499215 




499254 


507 


501710 


500790 


501710 


508 


502863 


501808 


502830 


509 


503675 


502692 


503645 



WO 99/28475 



171 



PCT/IB98/01939 



SEQ ID NO 




St ° ? 


preferred start 


510 






505002 


511 




506986 


505739 


512 


506999 






513 


508404 






514 


508291 


508590 


508297 


515 


508915 


508478 




516 


509600 






517 






511 147 


518 


51 1547 




511547 


519 


512382 




512385 


520 


514287 




514269 


521 


514789 




514792 


522 


514994 




515027 


523 


515553 




515553 


524 


515808 


516422 


515820 


525 


5 1 6476 




516605 


526 


517927 




517927 


527 


518096 


J 1 o JOU 


518114 


528 


518403 




518412 


529 


518923 




518923 


530 






519730 


531 


521986 




521971 


532 


522131 


521 886 


522125 


533 


523495 




523483 


534 


524591 




524510 


535 


524652 




524685 


536 


525731 




525752 


537 




526400 


525999 


538 




526735 


526361 


539 






528284 


540 


528861 


528292 


528828 


541 


529723 


529142 


529645 


542 


530166 


529624 


530166 


543 


530543 


530223 


530543 



WO 99/28475 



172 



PCT/IB98/01939 



SEQ ID NO 




-j — if 


preferred start 


544 


53 1375 




531363 


545 


532370 






546 


533849 




533849 


547 






534615 


548 




534878 


535915 


549 






539114 


550 


539731 




539731 


551 


540523 


540969 


540526 


552 


540906 




541002 


553 




541825 


543222 


554 


544133 




5441 15 


555 


544565 




544532 


556 


544762 




544747 


557 


546423 




546423 


558 


547480 






559 


546789 






560 


547901 




547826 


561 




j4 /yuu 


548634 


562 




549459 


548704 


563 


550385 






564 


551611 






565 


553041 






566 


554946 




J jWiO 


567 


556300 




556300 


568 






556524 


569 


558126 




558105 


570 


557810 




557810 


571 


559215 




559215 


572 






561349 


573 


562931 




562898 


574 


564083 


563121 


564083 


575 


563593 


563943 


563644 


576 


565379 


566953 


565379 


577 


567079 


567966 


567274 



WO 99/28475 



173 



PCT/IB98/01939 





begin 
368021 


stop 
570399 


preferred start 
568021 




571269 


572021 


571284 




572519 


572755 


572519 






572731 


573393 




572879 


573427 


573077 




574160 


573660 


574160 




574426 


574184 


574426 






574446 


574781 






574923 


575156 


587 




575057 


575458 






575469 


575735 






57802, 


576545 






578017 


578673 


591 




582104 


579012 


592 


582697 


582206 


582682 






582811 


583095 


594 






583484 


595 




58343 8 


583803 


596 




583827 


584399 


597 




584299 


584967 




585297 


585016 


585285 




585240 


586610 


585300 








586505 


601 




359408 


587786 


602 


589198 


589578 


589258 




590061 


589630 


589971 




590739 


591272 


590775 




592406 


592765 


592406 






592849 


593127 




J 9.3 91)0 


593121 


593894 


608 


594138 


595637 




609 


596122 


595640 


596053 


610 


596864 


596154 


596828 


611 


597731 


597282 


597689 



WO 99/28475 



174 



PCT/IB98/01939 





begin 


stop 


preferred- start 






ouusoy 


598551 




601876 


600734 


601864 






601910 


603520 


615 




603531 


603794 






603757 


604398 


617 


604549 


605610 


604549 


618 


ouooiy 




606619 






607493 


606867 




ouyuoo 


608031 


608972 






609296 


609652 




611 860 


610109 


611830 


623 




612927 


611815 




613597 


612938 


613444 


625 






613952 


626 




615244 


614441 


627 




615683 


615396 


628 




615864 


617624 






617510 


618361 


630 






619338 


631 


620416 




620401 


632 




620261 


619929 


633 




620420 


621154 


634 




621154 


621678 








622399 


636 






623421 


637 




623570 


624106 


638 




624073 


624918 




625346 


626665 


625367 




626514 


626900 


626652 




626954 


627853 


626984 


642 


627822 


628124 




643 


628715 


628146 


628715 


644 


628932 


629801 


628935 


645 


630406 


629804 


630298 



WO 99/28475 



175 



PCT/IB98/01939 





begin 


stop 


preferred start 




ojuyou 


630298 


630915 




631799 


630915 


631799 


648 


637488 


638084 


637488 


649 


638036 


640207 


638111 


650 


640221 


643472 


640236 


651 


640627 


640220 


640627 




643485 


644495 


643488 




64447 1 


645430 


644471 




645394 


645840 


645538 




045 e4U 


6471 1 1 


645840 




649676 


647109 


649616 




649970 


650344 


649970 




650418 


651722 


650433 




651686 


652171 


651770 






652908 


652516 




652799 


653593 


652892 




659884 


661851 


660136 




661740 


662282 


661851 




662286 


663074 


662289 




662951 


663730 


663074 






663745 


664194 






664255 


665619 




OOOUOJ 


665727 


666056 




666423 


665782 


666390 




666831 


6681 17 


667047 




668121 


668375 


668139 




668470 


668174 


668404 






668616 


669485 




o&yovz 


669485 


669892 




670780 


669998 


670765 


676 


671241 






677 


671182 


672447 


671260 


678 


672692 


673231 


672698 


679 


673204 


674562 


673204 



WO 99/28475 PCT/IB98/01939 



■ cph irv XTr\ 
otv^ UJ JNv 


begin 


stop 


preferred start 












675232 


674612 






676463 


675327 








677027 


683 




677700 


678422 








678708 






68U502 


679342 




680579 


681280 


680654 




681539 


682558 


681557 






683087 


682578 






684465 


683164 


690 






684639 


691 




ooozUi 


684839 




686197 


687204 


686203 


693 




688360 


687341 


694 


688432 


688193 


688426 


695 


689616 


688432 


689601 


696 




689631 


689939 


697 


690487 


689846 


690445 


698 


69071 7 




690717 


699 


691871 




691856 






692041 


693837 




694934 




694934 


702 




KyfVy^L 


697230 


703 






697958 


704 


698392 




698380 






700117 


698792 






700895 


700269 




700912 


702165 


700990 




702183 


703412 


702183 






705000 


703531 


710 


705011 


705604 




711 


706159 


705704 


706093 


712 


706521 


706138 


706488 


713 


708103 


706496 


707932 



WO 99/28475 



177 



PCT/IB98/01939 



SEQ ID NO 






preferred start 


714 


708398 


/UoU / o 


708392 




70861 0 




708610 


716 




708872 


710203 






710262 


71 1164 








71 1432 


719 


712767 


713438 


712773 


720 






714217 


721 


714632 




714617 


722 




714834 


715739 


723 




715558 


715854 






715921 


716886 


725 


718357 




718357 


726 






718590 


727 


719797 




719776 


728 


720273 




720147 


729 






720452 


730 


720613 




720613 


731 






721571 


732 


723248 




723239 


733 






724580 


734 


725763 




725760 


735 


726519 




726519 


736 


726819 




726801 


737 




726753 


727466 


738 




727469 


727984 


739 






728718 


740 


729346 




729334 


741 






732639 


742 






733246 


743 






734814 


744 


735644 


736504 




745 


736520 


737254 


736520 


746 


737254 


737787 


737254 


747 


737942 


738679 


738122 



WO 99/28475 



r 



178 



PCT/IB98/01939 



SEQ3DNG 


begin I stop 


preferred start 






739740 


738862 






740060 


741982 




742869 


742045 


742824 






742824 


743348 




744298 


743306 


744292 




744714 


744430 


744660 


754 


744985 


74461 1 


74493 1 


755 


745557 


744958 


745548 




746412 


745561 


746409 




746772 


746416 


746697 






746944 


748269 




748966 


748274 


748954 


760 


749426 


748965 


749411 






749433 


749681 




750029 


749721 


750020 






750007 


752307 




752913 


752503 


752901 


765 


754659 


753616 


754659 




755000 


756814 


755000 




756796 


758301 


756832 






758446 


758688 






759338 


759787 








760188 




760538 


7601 88 


760529 




760966 


761772 


760966 




761759 


762142 


761759 


774 


762267 


762983 


762267 


775 


764465 


763335 


764312 


776 


764857 


764438 


764821 




766068 




765972 


778 


766643 


766065 


766643 


779 


768091 


766934 


768091 


780 


768785 


768252 


768785 


781 


770092 


768791 


770062 



WO 99/28475 



1 



1 79 



PCT/IB98/01939 





begin 


stop 


preferred start 






770470 


770150 






770185 


77063 1 






770634 


770894 




772010 


771330 


772010 








772390 






773427 


774215 




776035 


774191 


776035 




776663 


777706 


776894 






776953 


777177 




779222 


777732 


779180 




779321 


781552 


779360 


793 




782442 


781351 


794 






782447 






/56002 


785697 




786580 


785546 


786580 








787729 


798 




78802 1 


787782 








790064 








790178 






792016 


790634 




793084 


792059 


793084 






794056 


793370 






794957 


794079 






795144 


795395 






796255 


795575 






797015 


7963 1 1 




796979 


797365 


796979 




797260 


797856 


797395 




797772 


798086 


797805 






797935 


798393 


812 


798925 


798416 




813 


799301 


799927 


799301 


814 


800892 


800029 


800892 


815 


801062 


802129 


801062 



WO 99/28475 



180 



PCT/IB98/01939 



SEQ ID ~HO 


begin 


stop 


preferred' start 


816 


802023 


802673 


802041 


817 


802851 


803246 


802920 


818 


803105 


804220 


803111 


819 


804307 


805356 


804331 


820 


805290 


806282 


805356 




806453 


808081 


806498 


822 


808026 


809009 


808098 


823 


810461 


809079 


810437 


824 


81 1605 


810328 


811590 


825 


81 1725 


812342 


811824 


826 


812329 


813522 


812398 


827 


813455 


813772 


813455 


828 


813732 


814334 


813780 




815213 


814314 


815207 


830 


814878 


814396 


814975 


831 


815733 


815428 


815733 


832 


816116 


817456 


816170 


833 


817608 


819320 


817608 




819324 


819713 


819342 




819704 


820402 


819713 






821061 


820453 




821043 


821537 


821043 


838 


821646 


822239 


821667 


839 


822182 


822931 


822221 


840 


824355 


823045 


824352 


841 


825894 


824359 


825891 


842 


826322 


825879 


826322 


843 


826340 


827026 


826340 


844 


827014 


827250 


827014 




827856 


827230 


827856 


846 


828007 




828025 


847 


829355 


830953 


829358 


848 


831119 


831748 


831140 


849 


832152 


831751 


832140 



WO 99/28475 



181 



PCT/IB98/01939 





begin 


stop 


preferred start 




832744 


832214 


832666 








833446 




833802 


833368 


833742 




834679 


833879 


834661 






834661 


835365 






835371 


835775 






835775 


836470 


857 


836602 




836617 


858 


837209 


838699 


837209 




5 JO /ou 




838760 


860 


839942 


840583 


839951 






841713 


84045 1 






842459 


841686 




842523 


843068 


842541 


864 




843031 


843447 


865 


843239 




843335 


866 




843802 


844077 


867 






848022 






8481 50 


850099 






850230 


851504 




853696 




853672 


871 




ojj /UU 


854809 


872 




854920 


855468 


873 






856240 






856233 


857006 




859439 


857451 


859430 




859946 


859587 


859916 


877 




860640 


859660 






860724 


861599 






861580 


862038 


880 


863540 


862098 




881 


863930 


863571 


863927 


882 


864697 


863996 


864688 


883 


864923 


866248 


864923 



WO 99/28475 



182 



PCT/IB98/01939 





begin 


stop 


preferred start 




866303 


866605 


866336 




866665 


867732 


866665 




867810 


869090 


867864 






869357 


869094 




869270 




869336 




871299 


872582 


871359 




872429 


o /Z56U 


872555 


891 


872773 


873915 


872773 






873360 


873668 




874028 


874438 


874067 






875386 


874796 


895 






875843 


896 


877872 


5/ /UUU 


877866 


897 




877876 


878157 




879098 


878172 


879098 




5 / 5653 




878886 


900 


879842 


879105 


879809 


901 


ooUoo J 


ScUUdz 


880885 


902 


88 1 863 


ooUooy 


881863 


903 


882904 


881948 


882901 






882901 


883761 




884296 


883661 


884296 




884996 




884984 




888777 


885166 


888771 


908 


890172 




890172 


909 






891146 








891427 


91 1 


893278 


891968 


893278 




893356 


893808 


893386 




893909 


893643 


893894 


914 


894276 






915 


894778 


894248 


894760 


916 


895892 


895050 


895874 


917 


895951 


896829 


895963 



WO 99/28475 



1. 



1 83 



PCT/IB98/01939 



cpn in mo 






preferred start 




900783 


897064 


yuu/ /4 




902032 


900791 




920 


902659 


903876 


902659 


921 


90373 1 




903731 








903860 






yut>4/4 


905725 




906493 




906493 






yo/uu] 


907306 


926 


908101 


908742 


90813 1 


927 


908721 


909194 


908724 


928 


909198 


909584 


909201 


929 


909583 


909951 


909670 




910081 




910090 


931 




01 AO/I /I 

y i uy44 


910636 


932 


9 1 0948 




910951 


933 






912399 


934 






912595 




913203 




913218 






914485 


913691 


937 




915136 


914522 


938 




915467 


915162 


939 






915629 


940 






916159 






917627 


916965 


942 




918304 


917612 






918655 


918323 








918682 






919829 


919542 


946 






919723 


947 


920184 


920840 




948 


920866 


921294 


920866 


949 


921272 


921514 


921272 


950 


921510 


921758 


921510 


951 


921778 


922143 


921778 



WO 99/28475 



184 



PCT/IB98/01939 



OCA TT\ VfH 


begin 


stop 


preferred start 




922159 


922491 


922159 


953 


922496 


923035 


922496 




923160 


923453 


923160 






924032 


923484 




Q1AHA Q 

?Z4U4o 




924057 




yZ444.5 


924937 


924443 






925364 


924933 




925390 


926760 


925390 




926819 


927184 


926819 




927209 


927604 


927209 






928155 


927577 




928 100 


928759 


928127 






930244 


929243 




930222 


930656 


930258 




930608 


931078 


930665 




931367 


931666 


931406 






931959 


931558 




932070 


932579 


932070 




932602 


933201 


932602 






933621 


933319 






933785 


933522 




934546 


933848 


934546 






934539 


936377 




93808 1 


936666 


938081 






939098 


938595 




939329 


940933 


939506 




941031 


942068 


941076 




942082 


944685 


942082 


980 


944634 


945287 


944673 






946294 


945287 


982 


946293 


946676 




983 


947105 


948454 


947132 


984 


948522 


949277 


948546 


985 


949277 


949594 


949277 



WO 99/28475 



185 



PCT/IB98/01939 





begia 


stop 


preferred start 




949849 


950676 


949888 






951330 


950701 


988 


951281 


951643 


951290 


989 


951788 


952798 


95 1 803 




953581 


954264 


953602 




954426 


955157 


954429 




955754 


957940 


955766 






959312 


957867 






961050 


959317 


995 


961562 


961053 


961562 




962575 


961487 


962545 




961979 


961584 


961979 




OA/10 1 /I 


962545 


964914 




A/C/1A/1 1 


965708 


964956 






966193 


966984 




yo/444 


968061 


967459 




968903 


968064 


968792 




970685 


969528 


970685 




971806 


971024 


971785 






972388 


973026 






973746 


974546 




975223 


974558 


975214 




976193 


975207 


976193 




976520 


976254 


9765 1 1 




976588 


976899 


976588 




976886 


977635 


976934 




977661 


977933 


977682 




977918 


978433 


977933 


1014 


978619 


978984 


978619 




978933 


97933 1 


978987 


1016 




979389 




1017 


979711 


980112 


979753 


1018 


982116 


981148 


982107 


1019 


982321 


983598 


982321 
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S.EQ ID NO 


begin 


stop 


preferred ".start 


1020 


984488 


983862 


984296 


1021 


985381 


984371 


985381 


1022 


986103 


985399 


986046 


1023 


986693 


986046 


986693 


1024 


987607 


986693 


987607 


1025 


9881 19 


987616 


987942 


1026 


988253 


987936 


988247 


1027 


988831 


989163 


988834 


1028 


989693 


993442 


989693 


1029 


993408 


993785 


993408 


1030 


993835 


993416 


993754 


1031 


993882 


994262 


993906 


1032 


994226 


995656 


994259 


1033 


996036 


996611 


996036 


1034 


996885 


998267 


996885 




998962 


999225 


998962 




yyyj /a 


1001033 


999393 




100121 1 


1001516 


1001214 




1001392 


1001664 


1001443 




1003721 


1001823 


1003721 


1040 


1004459 


1004845 


1004459 


1041 


1004990 


1005382 


1004990 




1005391 


1007496 


1005391 


1043 


1007486 


1007821 


1007453 




1 flATOm 

lUO/oOz 


1008698 


1007841 


1045 


1009426 


1009121 


1009426 


1046 


1010534 


1012054 


1010534 


1047 


1012397 


101 1942 


1012241 


1048 


1012042 


1012635 


1012057 




1012593 


1012862 


1012593 


1050 


101281 1 


1 01 'KAAO 


1012829 


1051 


1013456 


1014055 


1013468 


1052 


1013977 


1014489 


1013977 


1053 


1015224 


1014529 


1015206 



WO 99/28475 



1 



1 87 



PCT/IB98/01939 



i>bs£ LU i\U 


begin 


stop 


preferred start 




lUloOOz 


1015145 


1015963 


1055 


1017120 


1015939 


1017120 


1056 


1017766 


1017245 


1017658 






1017916 


1018893 




iuiyiy i 


1018580 


10191 10 






101983 1 


1020196 






10201 14 


1020992 




1 m 1 c/cn 


1021075 


1021557 


1062 


102241 1 


1022097 


1022402 


1063 


1023344 


1023667 


1023344 






1023949 


1023701 






1024776 


1023976 




1UZ4/U4 


1025045 


1 024704 




1 n">^cs 1 
luzjoo i 


1024967 


1025845 




1026546 


1025839 


1026546 


1069 


1027379 


1026546 


1027373 




1030604 


1027929 


1030328 




1033252 


1030508 


1033249 






1032086 


1031823 




1037037 


1033456 


1037016 




1035674 


1035910 


1035674 






1036507 


1036268 




Do(comp) 


1036967 


38 




16591 


16989 


16597 




31779 


31408 


31764 




56502 


56834 


56520 




56686 


56913 


56686 




64748 


65074 


64790 


1082 


73482 


73195 


73482 




78482 


78736 


78506 


1084 


79803 






1085 


82333 


81959 


82333 


1086 


87313 


86999 


87523 


1087 


109929 


109456 


109716 



WO 99/28475 



188 
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SEQ ID NO 




- St ° ? 


preferred start 


1088 






111599 


1089 








1090 


1 12666 






1091 








1092 


120757 




120757 


1093 






125133 


1094 


131888 




131837 


1095 


144164 




144191 


1096 


150698 




150635 


1097 


164385 




164385 


1098 


165690 




165408 


1099 


168742 






1100 


170509 


170793 




1101 


177145 






1102 


188295 


188023 


1 88295 


1103 


188791 


1 88330 




1104 


190629 


190336 




1 105 


197313 




197307 


1 106 


210914 




210956 


1 107 


235160 






1 108 


237227 




^37188 


1109 


249733 


249446 


249904 


1110 


253493 


253158 




1111 


253701 






1112 


271633 




271633 


1113 


275666 


276070 


275666 


1114 






277976 


1115 


282741 




282738 


1116 


293178 


293489 


293181 


1117 








1118 


309297 


308965 


309297 


1119 


312219 


312536 


312246 


1120 


312853 


312602 


312844 


1121 


313167 

1 


312772 


313167 



WO 99/28475 PCT/IB98/01939 



OCA TT~> XT/"\ 

Ml(^ ID JNU 


begin j stop 


preferred start 




320224 


320598 


320224 




340249 


340503 


340249 




352839 


353324 


352839 




373475 


373699 


373475 




377316 


377756 


377316 




379268 


379657 


379268 


1128 


395098 


394823 


395077 


1129 


401594 


401142 


401594 




410045 


410539 


410045 




41 1425 


411658 


411425 




414937 


414416 


414937 




422889 


423212 


422964 




427842 


428183 


427842 




428732 


42945 1 


428732 


1136 


442557 


442799 


442524 




443628 


44404 1 


443628 




443678 


443166 


443678 




445901 


446155 


445901 




467981 


468262 


468023 




471869 


472108 


471869 




488032 


488337 


488044 


1143 


497179 


497694 


497101 




500474 


500202 


500471 




508968 


509561 


508968 




510845 


511264 


510845 




526525 


526848 


526525 


1148 


531318 


531863 


531444 




556826 


557224 


556826 


1150 


564971 


564537 


564971 




566963 




566963 


1152 


570351 


570890 




1153 


571072 


571332 


571072 


1154 


576025 


575801 


576025 


1155 


590363 


590650 


590363 j 



WO 99/28475 



190 



PCT/IB98/01939 





begin 


stop 


preferred start 




597868 


598593 


597868 




ouooay 


606626 


606889 




60803 1 


607786 


60803 1 




610110 


610391 


610143 




632703 


633353 


632703 








637255 


1162 




o4yyz4 


650517 






652562 


652317 


1 164 




655325 


654753 






660810 


661118 






677057 


677578 




679528 


679253 


679477 






732210 


732536 




742069 




742069 


1 170 




758782 


759318 


1 171 


1 OUZoZ 




760282 


1 172 




770894 


771391 






772408 


772115 






788457 


788137 




816302 




816302 


1 176 


O400U0 


QA £01 /I 


846612 


1 177 


867803 


00oUj4 


867806 


1 178 




875658 


875395 




876445 


876915 


876445 






884312 


884548 




891859 


891467 


891859 




onn*7Tn 
yuu/ /u 


900417 


900728 






902269 


902529 




VUo(J4o 


907783 


908007 






912567 


912313 


1186 


935451 


935741 




1187 


946961 


946692 


946940 


1188 


953193 


952783 


953145 


1189 


966199 


965873 


966184 



WO 99/28475 



191 



PCT/IB98/01939 





begin 


stop 


preferred start 


1190 




968765 


969298 








971009 






y 1 Z4i)4 


972165 






973508 


9731 19 


1194 


998649 


998404 


998625 


1195 


1004280 


1003882 


1004280 


1196 


1010200 


1009532 


1010200 


1197 


1029174 


1029482 


1029180 
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192 
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TABLE 4 



QRFpeaset 


ORFoligosFd 


QKFo%osFp 


OKFoligosBd 


ORFoligosBp 


2 


1199 


1198 


3591 


3590 


3 


1201 


1200 


3593 


3592 


4 


1203 


1202 


3595 


3594 


5 


1205 


1204 


3597 


3596 


6 


1207 


1206 


3599 


3598 


7 


1209 


1208 


3601 


3600 


8 


1211 


1210 


3603 


3602 


9 


1213 


1212 


3605 


3604 


10 


1215 


1214 


3607 


3606 


11 


1217 


1216 


3609 


3608 


12 


1219 


1218 


3611 


3610 


13 


1221 


1220 


3613 


3612 


14 


1223 


1222 


3615 


3614 


15 


1225 


1224 


3617 


3616 


16 


1227 


1226 


3619 


3618 


17 


1229 


1228 


3621 


3620 


18 


1231 


1230 


3623 


3622 


19 


1233 


1232 


3625 


3624 


20 


1235 


1234 


3627 


3626 


21 


1237 


1236 


3629 


3628 


22 


1239 


1238 


3631 


3630 


23 


1241 


1240 


3633 


3632 


24 


1243 


1242 


3635 


3634 


25 


1245 


1244 


3637 


3636 


26 


1247 


1246 


3639 


3638 


27 


1249 


1248 


3641 


3640 






1250 


3643 


3642 


29 


1253 


1252 


3645 


3644 


30 


1255 


1254 


3647 


3646 


31 


1257 


1256 


3649 


3648 


32 


1259 


1258 


3651 


3650 


33 


1261 


1260 


3653 


3652 
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ORFoligosFp j ORFoligosBd 


ORFoligosBf 


34 




1262 


3655 


3654 


35 




1264 


3657 


3656 


36 




1266 


3659 


3658 


37 




1268 


3661 


3660 


38 




1270 


3663 


3662 


39 




1272 


3665 


3664 


40 




1274 


3667 


3666 


41 




1276 


3669 


3668 


42 




1278 


3671 


3670 




1281 


1280 


3673 


3672 


44 




1282 


3675 


3674 


45 




1284 


3677 


3676 


46 




1286 


3679 


3678 


47 




1288 


3681 


3680 


48 




1290 


3683 


3682 


49 




1292 


3685 


3684 


50 




1294 


3687 


3686 


51 


— 


1296 


3689 


3688 


52 




1298 


3691 


3690 


53 


1301 


1300 


3693 


3692 


54 




1302 


3695 


3694 


55 




1304 


3697 


3696 


56 




1306 


3699 


3698 






1308 


3701 


3700 


58 




1310 


3703 


3702 


59 




1312 


3705 


3704 


60 




1314 


3707 


3706 


61 




1316 


3709 


3708 






1318 


3711 


3710 


63 








3712 


64 


1323 


1322 


3715 




65 


1325 


1324 


3717 


3716 


66 


1327 


1326 


3719 


3718 


67 


1329 


1328 


3721 


3720 
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PC171B98/01939 





ORFoligosFd 


ORJFoiigosFp 


QRFoljgosBd 


ORFoligosBp 


68 




1330 


3723 


3722 


69 




1332 


3725 


3724 


70 






3727 


3726 


71 




1336 


3729 


3728 


72 






3731 


3730 


73 




1340 


3733 


3732 


74 




1342 


3735 


3734 


75 




1344 


3737 


3736 


76 




1346 


3739 


3738 


77 




1348 


3741 


3740 


78 




1350 


3743 


3742 


79 




1352 


3745 


3744 


80 


1355 


1354 


3747 


3746 


81 




1356 


3749 


3748 


82 




1358 


3751 


3750 


83 




1360 


3753 


3752 


84 






3755 


3754 


85 




1364 


3757 


3756 


86 




1366 


3759 


3758 


87 




1368 


3761 


3760 


88 




1370 


3763 


3762 


89 




1372 


3765 


3764 


90 






3767 


3766 


91 




1376 


3769 


3768 


92 




1378 


3771 


3770 


93 




1380 


3773 


3772 


94 




1382 


3775 


3774 


95 




1384 


3777 


3776 


96 




1386 


3779 


3778 


97 




1388 


3781 


3780 


98 


1391 


1390 


3783 




99 


1393 


1392 


3785 


3784 


100 


1395 


1394 


3787 


3786 


101 


1397 


1396 


3789 


3788 
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ORFoligosFd | ORFoKgosFp 


ORFoligosM 


ORFoligosBp 




1399 


1398 


3791 


3790 


103 


1401 


1400 


3793 


3792 




1403 


1402 


3795 


3794 




1405 


1404 


3797 


3796 


106 


1407 


1406 


3799 


3798 




1409 


1408 


3801 


3800 


108 


1411 


1410 


3803 


3802 




1413 


1412 


3805 


3804 




1415 


1414 


3807 


3806 




1417 


1416 


3809 


3808 




1419 


1418 


381 1 


3810 




1421 


1420 


3813 


3812 


1 14 


1423 


1422 


3815 


3814 


1 15 


1425 


1424 


3817 


3816 




1427 


1426 


3819 


3818 




1429 


1428 


3821 


3820 




1431 


1430 


3823 


3822 




1433 


1432 


3825 


3824 




1435 


1434 


3827 


3826 




1437 


1436 


3829 


3828 




1439 


1438 


3831 


3830 




1441 


1440 


3833 


3832 




1443 


1442 


3835 


3834 




1445 


1444 


3837 


3836 


126 


1447 


1446 


3839 


3838 




1449 


1448 


3841 


3840 




1451 


1450 


3843 


3842 


129 


1453 


1452 


3845 


3844 




1455 


1454 


3847 


3846 








3849 


3848 


132 


1459 


1458 






133 


1461 


1460 


3853 


3852 


134 


1463 


1462 


3855 


3854 


135 


1465 


1464 


3857 


3856 
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wivrvJCfLwci 




ORFoligosFp | ORFoligosBd 


OJRFoHgosBp 


136 




1466 


3859 


3858 


137 


1469 


1468 


3861 


3860 


138 


1471 


1470 


3863 


3862 


139 




1472 


3865 


3864 


140 




1474 


3867 


3866 


141 


1477 


1476 


3869 


3868 


142 


1479 


1478 


3871 


3870 


143 




1480 


3873 


3872 


144 


1483 


1482 


3875 


3874 


145 
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4771 


4770 


593 


2381 




4773 


4772 


594 


2383 




4775 


4774 


595 


2385 




4777 


4776 


596 






4779 


4778 


597 


2389 




4781 


4780 


598 






4783 


4782 


599 


2393 




4785 


4784 


600 


2395 




4787 


4786 


601 






4789 


4788 


602 






4791 


4790 


603 






4793 


4792 


604 




2402 


4795 


4794 


605 






4797 


4796 


606 






4799 


4798 


607 






4801 


4800 


608 


2411 


2410 


4803 


4802 


609 


2413 


2412 


4805 


4804 


610 


2415 


2414 


4807 


4806 


611 J 2417 


2416 


4809 


4808 
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QRFoligosBd 


ORFoligosBp 




2419 


2418 


4811 


4810 




2421 


2420 


4813 


4812 






2422 


4815 


4814 


615 




2424 


4817 


4816 






2426 


4819 


4818 


617 




2428 


4821 


4820 




2431 


2430 


4823 


4822 




2433 


2432 


4825 


4824 




2435 


2434 


4827 


4826 




2437 


2436 


4829 


4828 




2439 


2438 


4831 


4830 


623 


2441 


2440 


4833 


4832 




2443 


2442 


4835 


4834 




2445 


2444 


4837 


4836 




2447 


2446 


4839 


4838 






2448 


4841 


4840 


628 




2450 


4843 


4842 


629 




2452 


4845 


4844 


630 




2454 


4847 


4846 


631 




2456 


4849 


4848 




2459 


2458 


4851 


4850 






2460 


4853 


4852 






2462 


4855 


4854 




2465 


2464 


4857 


4856 




2467 


2466 


4859 


4858 




2469 


2468 


4861 


4860 




2471 


2470 


4863 


4862 




2473 


2472 


4865 


4864 




2475 


2474 


4867 


4866 






2476 


4869 


4868 


642 


2479 


2478 






643 


2481 


2480 


4873 


4872 


644 


2483 


2482 


4875 


4874 


645 


2485 


2484 


4877 


4876 
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QRFoHgosBdf 
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2487 


2486 


4879 


4878 




2489 


2488 


4881 


4880 




2491 


2490 


4883 


4882 




2493 


2492 


4885 


4884 




2495 


2494 


4887 


4886 


651 


2497 


2496 


4889 


4888 


652 


2499 


2498 


4891 


4890 




2501 


2500 


4893 


4892 




2503 


2502 


4895 


4894 




2505 


2504 


4897 


4896 


656 


2507 


2506 


4899 


4898 


657 


2509 


2508 


4901 


4900 


658 


2511 


2510 


4903 


4902 


659 


2513 


2512 


4905 


4904 


660 


2515 


2514 


4907 


4906 




2517 


2516 


4909 


4908 


662 


2519 


2518 


4911 


4910 


663 


2521 


2520 


4913 


4912 


664 


2523 


2522 


4915 


4914 


665 


2525 


2524 


4917 


4916 


666 


2527 


2526 


4919 


4918 


667 


2529 


2528 


4921 


4920 




2531 


2530 


4923 


4922 


669 


2533 


2532 


4925 


4924 




2535 


2534 


4927 


4926 


671 


2537 


2536 


4929 


4928 


672 


2539 


2538 


4931 


4930 


673 


2541 


2540 


4933 


4932 


674 


2543 


2542 


4935 


4934 


675 






4937 


4936 


676 


2547 


2546 






677 


2549 


2548 


4941 


4940 


678 


2551 


2550 


4943 


4942 


679 


2553 


2552 


4945 


4944 



WO 99/28475 



212 



PCT/IB98/01939 





QRFoligosFd 


: ORFoligosFp 
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2555 


2554 


4947 


4946 




2557 


2556 


4949 


4948 




2559 


2558 


4951 


4950 




2561 


2560 


4953 


4952 




2563 


2562 


4955 


4954 




2565 


2564 


4957 


4956 




2567 


2566 


4959 


4958 




2569 


2568 


4961 


4960 




2571 


2570 


4963 


4962 




2573 


2572 


4965 


4964 




2575 


2574 


4967 


4966 


691 


2577 


2576 


4969 


4968 


692 


2579 


2578 


4971 


4970 




2581 


2580 


4973 


4972 




2583 


2582 


4975 


4974 






2584 


4977 


4976 




2587 


2586 


4979 


4978 




2589 


2588 


4981 


4980 




2591 


2590 


4983 


4982 




2593 


2592 


4985 


4984 




2595 


2594 


4987 


4986 






2596 


4989 


4988 




2599 


2598 


4991 


4990 




2601 


2600 


4993 


4992 




2603 


2602 


4995 


4994 


705 


2605 


2604 


4997 


4996 




2607 


2606 


4999 


4998 




2609 


2608 


5001 


5000 




2611 


2610 


5003 


5002 




2613 


2612 


5005 


5004 


710 


2615 


2614 






711 


2617 


2616 


5009 


5008 


712 


2619 


2618 


5011 


5010 


713 


2621 


2620 


5013 


5012 
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714 


2623 


2622 


5015 


5014 


715 


2625 


2624 


5017 


5016 


716 


2627 


2626 


5019 


5018 


717 


2629 


2628 


5021 


5020 


718 


2631 


2630 


5023 


5022 


719 


2633 


2632 


5025 


5024 


720 


2635 


2634 


5027 


5026 




2637 


2636 


5029 


5028 


722 


2639 


2638 


5031 


5030 


723 


2641 


2640 


5033 


5032 


724 


2643 


2642 


5035 


5034 


725 


2645 


2644 


5037 


5036 


726 


2647 


2646 


5039 


5038 


727 


2649 


2648 


5041 


5040 


728 


2651 


2650 


5043 


5042 


729 


2653 


2652 


5045 


5044 


730 


2655 


2654 


5047 


5046 


731 


2657 


2656 


5049 


5048 


732 


2659 


2658 


5051 


5050 


733 


2661 


2660 


5053 


5052 


734 


2663 


2662 


5055 


5054 


735 


2665 


2664 


5057 


5056 


736 


2667 


2666 


5059 


5058 


737 


2669 


2668 


5061 


5060 


738 


2671 


2670 


5063 


5062 


739 


2673 


2672 


5065 


5064 


740 


2675 


2674 


5067 


5066 


741 


2677 


2676 


5069 


5068 




2679 


2678 


5071 


5070 


743 


2681 


2680 


5073 


5072 


744 


2683 


2682 


5075 




745 


2685 


2684 


5077 


5076 


746 


2687 


2686 


5079 


5078 


747 


2689 2688 


5081 


5080 
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748 




2690 


5083 


5082 


749 




2692 


5085 


5084 




2695 


2694 


5087 


5086 




2697 


2696 


5089 


5088 






2698 


5091 


5090 


753 




2700 


5093 


5092 


754 




2702 


5095 


5094 


755 


2705 


2704 


5097 


5096 


756 




2706 


5099 


5098 




2709 


2708 


5101 


5100 




2711 


2710 


5103 


5102 




2713 


2712 


5105 


5104 


760 




2714 


5107 


5106 




2717 


2716 


5109 


5108 


762 


.,„ 2719 


2718 


5111 


5110 


763 




2720 


5113 


5112 


764 




2722 


5115 


5114 




2725 


2724 


5117 


5116 






2726 


5119 


5118 


767 




2728 


5121 


5120 






2730 


5123 


5122 


769 




2732 


5125 


5124 


770 




2734 


5127 


5126 


771 




2736 


5129 


5128 


772 




2738 


5131 


5130 




2741 


2740 


5133 


5132 


774 




2742 


5135 


5134 


775 




2744 


5137 


5136 




2747 


2746 


5139 


5138 


777 




2748 


5141 


5140 


778 


2751 


2750 






779 


2753 


2752 


5145 


5144 


780 


2755 


2754 


5147 


5146 


781 


2757 


2756 


5149 


5148 



WO 99/28475 



215 



PCT/IB98/01939 





ORFoiigosFd j ORFoligosFp 


QRFoHgosBd j ORFpUgosBp.: 




2759 


2758 


5151 


5150 




2761 


2760 


5153 


5152 


784 


2763 


2762 


5155 


5154 


785 


2765 


2764 


5157 


5156 




2767 


2766 


5159 


5158 




2769 


2768 


5161 


5160 




2771 


2770 


5163 


5162 




2773 


2772 


5165 


5164 




2775 


2774 


5167 


5166 




2777 


2776 


5169 


5168 




2779 


2778 


5171 


5170 


793 


2781 


2780 


5173 


5172 




2783 


2782 


5175 


5174 




2785 


2784 


5177 


5176 




2787 


2786 


5179 


5178 




2789 


2788 


5181 


5180 


798 


2791 


2790 


5183 


5182 


799 


2793 


2792 


5185 


5184 


800 


2795 


2794 


5187 


5186 


801 


2797 


2796 


5189 


5188 




2799 


2798 


5191 


5190 


803 


2801 


2800 


5193 


5192 




2803 


2802 


5195 


5194 




2805 


2804 


5197 


5196 




2807 


2806 


5199 


5198 




2809 


2808 


5201 


5200 




2811 


2810 


5203 


5202 


809 


2813 


2812 


5205 


5204 




2815 


2814 


5207 


5206 


81 1 


2817 




5209 


5208 


812 


2819 


2818 


5211 


5210 


813 


2821 


2820 


5213 


5212 


814 


2823 


2822 


5215 


5214 


815 


2825 


2824 


5217 


5216 



WO 99/28475 PCT/IB98/01939 



ORFGfeHset 




ORFoligosfp 


ORFoMgoaB4 


ORFoligosBp 


816 




2826 


5219 


5218 


817 




2828 


5221 


5220 


818 




2830 


5223 


5222 


819 




2832 


5225 


5224 


820 


2835 


2834 


5227 


5226 


821 




2836 


5229 


5228 


822 




2838 


5231 


5230 


823 




2840 


5233 


5232 


824 




2842 


5235 


5234 


825 




2844 


5237 


5236 


826 




2846 


5239 


5238 


827 




2848 


5241 


5240 


828 




2850 


5243 


5242 


829 




2852 


5245 


5244 


830 




2854 


5247 


5246 


831 




2856 


5249 


5248 


832 




2858 


5251 


5250 


833 




2860 


5253 


5252 


834 




2862 


5255 


5254 


835 




2864 


5257 


5256 


836 




2866 


5259 


5258 


837 






5261 


5260 


838 


2871 




5263 


5262 


839 




2872 


5265 


5264 


840 






5267 


5266 


841 




2876 


5269 


5268 


842 




2878 


5271 


5270 


843 




2880 


5273 


5272 


844 


2883 


2882 


5275 


5274 


845 






5277 


5276 


846 


2887 


2886 


5279 




847 


2889 


2888 


5281 


5280 


848 


2891 


2890 


5283 


5282 


849 


2893 


2892 


5285 


5284 
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2895 


2894 


5287 


5286 




2897 


2896 


5289 


5288 




2899 


2898 


5291 


5290 


853 


2901 


2900 


5293 


5292 




2903 


2902 


5295 


5294 




2905 


2904 


5297 


5296 


856 


2907 


2906 


5299 


5298 


857 


2909 


2908 


5301 


5300 




291 1 


2910 


5303 


5302 




2913 


2912 


5305 


5304 


860 


2915 


2914 


5307 


5306 


861 


2917 


2916 


5309 


5308 




2919 


2918 


5311 


5310 


863 


2921 


2920 


5313 


5312 




2923 


2922 


5315 


5314 




2925 


2924 


5317 


5316 


866 


2927 


2926 


5319 


5318 


867 


2929 


2928 


5321 


5320 


868 


2931 


2930 


5323 


5322 




2933 


2932 


5325 


5324 




2935 


2934 


5327 


5326 




2937 


2936 


5329 


5328 




2939 


2938 


5331 


5330 


873 


2941 


2940 


5333 


5332 


874 


2943 


2942 


5335 


5334 


875 


2945 


2944 


5337 


5336 




2947 


2946 


5339 


5338 


877 


2949 


2948 


5341 


5340 


878 


2951 


2950 


5343 


5342 




2953 


2952 


5345 


5344 


880 


2955 


2954 




5346 


881 


2957 


2956 


5349 


5348 


882 


2959 


2958 


5351 


5350 


883 


2961 


2960 


5353 


5352 
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884 


2963 


2962 




5354 


885 


2965 


2964 






886 


2967 


2966 




— 


887 


2969 


2968 




... 

5360 


888 


2971 


2970 




5362 


889 


2973 


2972 


5365 




890 


2975 


2974 




5366 


891 


2977 


2976 




5368 


892 


2979 


2978 




5370 


893 


2981 


2980 




5372 


894 


2983 


2982 




5374 


895 


2985 


2984 




5376 


896 


2987 


2986 




5378 


897 


2989 


2988 




5380 


898 


2991 


2990 




5382 


899 


2993 


2992 


538j 


5384 


900 


2995 


2994 




5386 


901 


2997 


2996 




5388 


902 


2999 


2998 




5390 


903 


3001 


3000 


5393 




904 


3003 


3002 




5394 


905 


3005 


3004 




5396 


906 


3007 


3006 




5398 


907 


3009 


3008 




5400 


908 


3011 


3010 


5403 




909 


3013 


3012 






910 


3015 


3014 




5406 


911 


3017 


3016 




5408 


912 


3019 






5410 


913 


3021 






5412 


914 


3023 


3022 


5415 


5414 


915 


3025 


3024 


5417 


5416 


916 


3027 


3026 


5419 


5418 


917 


3029 


3028 


5421 


5420 
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918 


3031 


3030 




542 l 


919 


3033 


3032 




5424 


920 


3035 


3034 




5426 


921 


3037 




5429 


5428 


922 


3039 






5430 


923 


3041 


3040 




5432 


924 


3043 


3042 




5434 


925 


3045 


3044 




5436 


926 


3047 




5439 


5438 


927 


3049 




5441 


5440 


928 


3051 




5443 


5442 


929 


3053 


3352 




5444 


930 


3055 


3054 


5447 


5446 


931 


3057 


3056 




5448 


932 


3059 


3058 




5450 


933 


3061 


3060 


5453 


5452 


934 


3063 




5455 


5454 


935 


3065 






5456 


936 


3067 


3066 




5458 


937 


3069 


3068 




5460 


938 


3071 


3070 




5462 


939 


3073 


3072 




5464 


940 


3075 


3074 




5466 


941 


3077 






5468 


942 


3079 


3078 




5470 


943 


3081 






5472 


944 


3083 


3082 




5474 


945 


3085 


3084^ 


5477 


5476 


946 


3087 




5479 


5478 


947 


3089 






5480 


948 


3091 


3090 


5483 


5482 


949 


3093 


3092 


5485 


5484 


950 


3095 


3094 


5487 


5486 


951 


3097 


3096 


5489 


5488 
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952 






5491 


5490 


953 


3101 




5493 


5492 


954 






5495 


5494 


955 






5497 


5496 


956 






5499 


5498 


957 






5501 


5500 


958 




3110 


5503 


5502 


959 




3112 


5505 


5504 


960 




3114 


5507 


5506 


961 




3116 


5509 


5508 


962 




3118 


551 1 


5510 


963 




3120 


5513 


5512 


964 


3123 


3122 


5515 


5514 


965 




3124 


5517 


5516 


966 




3126 


5519 


5518 


967 


3129 




5521 


5520 


968 






5523 


5522 


969 






5525 


5524 


970 


3135 




5527 


5526 


971 


3137 




5529 


5528 


972 




3138 


5531 


5530 


973 




3140 


5533 


5532 


974 






5535 


5534 


975 






5537 


5536 


976 




3146 


5539 


5538 




3149 


3148 


5541 


5540 


978 




3150 


5543 


5542 






3152 


5545 


5544 




3155 


3154 


5547 


5546 


981 




3156 


5549 


5548 


982 


3159 


3158 






983 


3161 


3160 


5553 


5552 


984 


3163 


3162 


5555 


5554 


985 


3165 


3164 


5557 


5556 
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986 




3166 


5559 


5558 


987 




3168 




5560 


988 




3170 




5562 


989 




3172 


5565 


5564 


990 








5566 


991 




3176 




5568 


992 


3179 


3178 




5570 


993 




3180 


5573 


5572 


994 




3182 


5575 


5574 


995 


3185 


3184 


5577 


5576 


996 




3186 


5579 


5578 


997 




3188 


5581 


5580 


998 




3190 


5583 


5582 


999 


3193 


3192 


5585 


5584 


1000 




3194 


5587 


5586 


1001 




3196 


5589 


5588 


1002 




3198 


5591 


5590 


1003 


3201 


3200 


5593 


5592 


1004 






5595 


5594 


1005 








5596 


1006 




3206 


5599 


5598 


1007 








5600 


1008 




3210 




5602 


1009 




3212 


5605 


5604 


1010 




3214 




5606 


1011 




3216 




5608 


1012 




3218 


5611 


5610 


1013 


3221 


3220 


5613 


5612 


1014 




3222 


5615 


5614 


1015 


3225 


3224 


5617 


5616 


1016 


3227 


3226 


5619 


5618 


1017 


3229 


3228 


5621 


5620 


1018 


3231 


3230 


5623 


5622 


1019 


3233 


3232 


5625 


5624 
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ORFoligosFd 


ORFoligosFp 


ORFoMgosBcL 


ORFoJigosBp 


1020 


3235 


3234 




5626 


1021 


3237 




5629 


5628 


1022 


3239 




5631 


5630 


1023 


3241 




5633 


5632 


1024 


3243 


3242 


5635 


5634 


1025 


3245 




5637 


5636 


1026 


3247 




5639 


5638 


1027 


3249 




5641 


5640 


1028 


3251 




5643 


5642 


1029 


3253 




5645 


5644 


1030 


3255 




5647 


5646 


1031 


3257 




5649 


5648 


1032 


3259 




5651 


5650 


1033 


3261 


3260 


5653 


5652 


1034 


3263 


3262 


5655 


5654 


1035 


3265 




5657 


5656 


1036 


3267 




3659 


5658 


1037 


3269 




5661 


5660 


1038 


3271 




5663 


5662 


1039 


3273 


3272 




5664 


1040 


3275 


3274 




5666 


1041 


3277 


3276 


5669 


5668 


1042 


3279 




5671 


5670 


1043 


3281 


3280 


5673 


5672 


1044 


3283 




5675 


5674 


1045 


3285 






5676 


1046 


3287 




5679 


5678 


1047 


3289 




5681 


5680 


1048 


3291 




5683 


5682 


1049 


3293 






5684 


1050 


3295 


3294 


5687 


5686 


1051 


3297 


3296 


5689 


5688 


1052 


3299 


3298 


5691 


5690 


1053 


3301 


3300 


5693 


5692 
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PCT/IB98/01939 



ORFGecset 




kjx\ r on gusr p 


UKrOiJgOSBa 


ORFoligosBp; 


1054 


3303 






5694 


1055 


3305 


3304 




5696 


1056 


3307 


3306 




5698 


1057 


3309 


3308 




5700 


1058 


3311 


3310 




5702 


1059 


3313 


3312 




5704 


1060 


3315 


3314 




5706 


1061 


3317 




5709 


5708 


1062 


3319 




5711 


5710 


1063 


3321 


3320 


5713 


5712 


1064 


3323 






5714 


1065 


3325 


3324 




5716 


1066 


3327 


3326 




5718 


1067 


3329 


3328 




5720 


1068 


3331 


3330 




5722 


1069 


3333 


3332 




5724 


1070 


3335 


3334 




5726 


1071 


3337 


3336 




5728 


1072 


3339 


3338 




5730 


1073 


3341 


3340 




5732 


1074 


3343 


3342 




5734 


1075 


3345 


3344 




5736 


1076 


3347 


3346 




5738 


1077 


3349 


3348 




5740 


1078 


3351 


3350 




5742 


1079 


3353 






5744 


1080 


3355 






5746 


1081 


3357 


3356 




5748 


1082 


3359 




5751 


5750 


1083 


3361 


3360 




5752 


1084 


3363 


3362 


5755 


5754 


1085 


3365 


3364 


5757 


5756 


1086 


3367 


3366 


5759 


5758 


1087 


3369 


3368 


5761 


5760 
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QKFGsisset 




UKi'OilgOSrp 


ORFoligosBd J ORFoligosBjj 


1088 


3371 




5763 


5762 


1089 


3373 




5765 


5764 


1090 


3375 




5767 


5766 


1091 


3377 




5769 


5768 


1092 


3379 




5771 


5770 


1093 


3381 




5773 


5772 


1094 


3383 




5775 


5774 


1095 


3385 


3384 


5777 


5776 


1096 


3387 




5779 


5778 


1097 


3389 




5781 


5780 


1098 


3391 




5783 


5782 


1099 


3393 




5785 


5784 


1100 


3395 




5787 


5786 


1101 


3397 




5789 


5788 


1102 


3399 




5791 


5790 


1103 


3401 




5793 


5792 


1104 


3403 




5795 


5794 


1105 


3405 




5797 


5796 


1106 


3407 




5799 


5798 


1107 


3409 




5801 


5800 


1108 


341 1 




5803 


5802 


1109 


3413 




5805 


5804 


1110 


3415 




5807 


5806 


1111 


3417 




5809 


5808 


1112 


3419 




5811 


5810 


1113 


3421 




5813 


5812 


1114 


3423 




5815 


5814 


1115 


3425 




5817 


5816 


1116 






5819 


5818 


1117 










1118 


3431 


3430 


5823 


5822 


1119 


3433 


3432 


5825 


5824 


1120 


3435 


3434 


5827 


5826 


1121 


3437 | 3436 


5829 


5828 
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ORFGenset 




UKroiigoarp 


ORJFoligosBd 


ORFoligosBp 


1122 






5831 


5830 


1123 


3441 




5833 


5832 


1124 


3443 




5835 


5834 


1125 


3445 




5837 


5836 


1126 


3447 




5839 


5838 


1127 


3449 




5841 


5840 


1128 


3451 




5843 


5842 


1129 


3453 




5845 


5844 


1130 


3455 


3454 


5847 


5846 


1131 


3457 




5849 


5848 


1132 


3459 




5851 


5850 


1133 


3461 




5853 


5852 


1134 


3463 




5855 


5854 


1135 


3465 




5857 


5856 


1136 


3467 


3466 


5859 


5858 


1137 


3469 




5861 


5860 


1138 


3471 


3470 


5863 


5862 


1139 


3473 




5865 


5864 


1140 


3475 




5867 


5866 


1141 


3477 




5869 


5868 


1142 


3479 


3478 


5871 


5870 


1143 


3481 




5873 


5872 


1144 


3483 


3482 




5874 


1145 


3485 




5877 


5876 


1146 


3487 




5879 


5878 


1147 


3489 




5881 


5880 


1148 


3491 




5883 


5882 


1149 


3493 




5885 


5884 


1150 


3495 




5887 


5886 


1151 


3497 




5889 


5888 


1152 


3499 


3498 


5891 


5890 


1153 


3501 


3500 


5893 


5892 


1154 


3503 


3502 


5895 


5894 


1155 


3505 


3504 


5897 


5896 
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ORFGensei 




'•JivT Oil gOSx' p 


ORFoligosBd 


ORFpligosBp: 


1156 


3507 




5899 


5898 


1157 


3509 


3508 


5901 


5900 


1158 


3511 






5902 


1159 


3513 




5905 


5904 


1160 


3515 


3514 


5907 


5906 


1161 


3517 




5909 


5908 


1162 


3519 




5911 


5910 


1163 


3521 




5913 


5912 


1164 


3523 




5915 


5914 


1165 


3525 


3^6 


5917 


5916 


1166 


3527 




5919 


5918 


1167 


3529 




5921 


5920 


1168 


3531 




5923 


5922 


1169 


3533 




5925 


5924 


1170 


3535 




5927 


5926 


1171 


3537 




5929 


5928 


1172 


3539 


3538 


5931 


5930 


1173 


3541 




5933 


5932 


1174 


3543 


3542 


5935 


5934 


1175 


3545 




5937 


5936 


1176 


3547 




5939 


5938 


1177 


3549 




5941 


5940 


1178 


3551 




5943 


5942 


1179 


3553 


3552 


5945 


5944 


1180 


3555 




5947 


5946 


1181 


3557 




5949 


5948 


1182 


3559 




5951 


5950 


1183 


3561 




5953 


5952 


1184 


3563 




5955 


5954 


1185 


3565 




5957 


5956 


1186 


3567 


3566 


5959 


5958 


1187 


3569 


3568 


5961 


5960 


1188 


3571 


3570 


5963 


5962 


1189 


3573 


3572 


5965 


5964 
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ORBGenset 


ORFoligosFd 






ORFoligosBp 


1190 


3575 






5966 


1191 


3577 


3576 




5968 


1192 


3579 






5970 


1193 


3581 






5972 


1194 


3583 


3582 


5975 


5974 


1195 


3585 


3584 


5977 


5976 


1196 


3587 


3586 


5979 


5978 


1197 


3589 


3588 


5981 


5980 
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TABLE 5 



SEQID 




position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 














785793 




4388 


B 


394245 














789918 




4389 


B 


396116 


1200 












788039 




4390 


B 


395604 


1201 


p 










790378 




4391 


B 


397475 














788456 




4392 


B 


396249 


1203 












791834 




4393 


B 


398133 


1204 












789918 




4394 


B 


396759 


1205 












793102 




4395 


B 


398660 


1206 












791176 




4396 


B 


397746 


1207 


p 








,93826 


4397 


B 


399639 


1208 










791921 


4398 


B 


398973 


1209 


p 


5961 






794911 


4399 


B 


400878 


1210 


p 








793023 


4400 


B 


399921 


121 1 










795296 


4401 


B 


401846 


1212 










793427 


4402 


B 


400393 


1213 


p 


6834 






796003 


4403 


B 


402287 


1214 


p 








794127 


4404 


B 


401444 


1215 


p 


7709 






796729 


4405 


B 


403344 


1216 


p 








794811 


4406 


B 


402258 


1217 


p 








797041 


4407 


B 


404150 


1218 










795065 


4408 


B 


403461 


1219 










797553 


4409 


B 


405340 


1220 


p 








795651 


4410 


B 


405400 


1221 










797716 


4411 


B 


407325 


1222 










795815 


4412 


B 


404027 


1223 










798197 


4413 


B 


405941 


1224 


F 


14529 


2819 


F 


796285 








1225 


F 


12643 


2820 


F 


799004 


4415 


B 


408055 


1226 


F 


14668 


2821 


F 


797173 




4416 


B 


407325 


1227 


F 


12813 




2822 


F 


799785 




4417 


B 


409172 


1228 


F 


15747 




2823 


F 


797910 




4418 


B 


409999 


1229 


F 


13844 




2824 


F 


800789 




4419 


B 


411893 
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SEQ ID 




position 


1230 


p 




1231 


p 


14019 


1232 


p 




1233 


p 


15298 


1234 


p 




1235 


p 




1236 


p 




1237 






1238 


p 




1239 


p 




1240 


p 


22259 


1241 






1242 






1243 


p 




1244 


p 


22890 


1245 


p 


20987 


1246 


p 


23 150 


1247 


p 


21244 


1248 


F 




1249 


F 




1250 


p 


26379 


1251 


F 




1252 


p 


27498 


1253 


p 




1254 


p 


28476 


1255 


p 




1256 


p 




1257 






1258 


p 




1259 






1260 


F 


31184 


1261 


F 


29287 


1262 


F 


31574 


1263 


F 


29650 



229 



SEQ ID 




position 






798866 






80180 0 












802561 












o UZ o O _L 






8 00926 






8 04088 












8 05071 
























8 07742 


2839 




805907 






OwOOOU 






8 06 95 9 






810 0 74 






q n c o n o 


















812 08 8 


















































2855 


F 




2856 


F 


815188 


2857 




813268 


2858 


F 


815897 
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SEQ ID 


Or . 


position 


4420 


B 


411645 


4421 


B 


413542 


4422 


B 


413693 


4423 


B 


415530 


4424 


B 


413693 


4425 


B 


415559 


4426 


B 


414172 


4427 


B 


416072 


4428 


B 


415337 


4429 


B 


417275 


4430 


B 


414599 


4431 


B 


416499 


4432 


B 


416887 


4433 


B 


418821 


4434 


B 


417700 






419585 




B 


418274 






420173 


4438 


B 


418823 


4439 


B 


420732 


4440 


B 


419778 


4441 




421678 




B 


420461 






422361 


4444 


B 


421460 


4445 


B 


423336 


4446 


B 


422265 


4447 


B 


424120 


4448 


B 


423263 






425182 








4451 


B 


427252 


4452 


B 


426283 


4453 


B 


428210 
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SEQID 


Or. 


position 








position 


1264 


F 


33095 




2859 






1265 


F 


31184 










1266 


F 


33840 










1267 


F 


31949 




2862 




819089 


1268 


F 


34769 




2863 




8 172 0 1 


1269 


F 


32869 




2864 




819482 


1270 


F 


34915 




2865 






1271 


F 


32961 










1272 


F 


35696 




2867 




8182 52 


1273 


F 


33793 




2868 




820800 


1274 


F 


36794 




2869 




818900 


1275 


F 


34893 




2870 






1276 


F 


37960 




2871 




8195 00 


1277 


F 


36085 




2872 






1278 


F 


38924 




2873 




82 0 003 


1279 


F 


37017 




2874 




8 22811 


1280 


F 


39704 




2875 


p 


820926 


1281 


F 


37754 




2876 


F 




1282 


F 


40541 




2877 






1283 


F 


38615 




2878 






1284 


F 


41945 




2879 




823 74 7 


1285 


F 


40054 










1286 


F 


42779 










1287 


F 


40859 




2882 






1288 


F 


43991 




2883 




324894 


1289 


F 


42061 




2884 






1290 


F 


45056 










1291 


F 


43155 




2886 






1292 


F 


45755 










1293 


F 


43821 










1294 


F 


46272 




2889 


F 


827236 


1295 


F 


44382 




2890 


F 


830870 


1296 


F 


46654 




2891 


F 


828917 


1297 


F 


44763 




2892 


F 


831522 
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SEQ ID 


Or . 


position 


4454 


B 


427252 


4455 


B 


429129 


4456 


B 


428040 


4457 




429940 


4458 


B 


430106 


4459 


B 


432063 


4460 


B 


430580 


4461 


B 


432480 


4462 


B 


430860 


4463 




432776 


4464 


B 


432063 


4465 


B 


433919 


4466 


B 


432263 


4467 


B 


434137 


4468 


B 


434730 


4469 


B 


436671 






436671 






438495 






436803 




B 


438696 




B 


437953 




B 


439850 




B 


438490 


4477 


B 


440383 


4478 


B 


43 9374 


4479 


B 


441289 


4480 


B 


439562 


4481 


B 


441466 


4482 


B 


439976 




B 


441847 








4485 


B 


443216 


4486 


B 


442161 


4487 


B 


444066 
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SEQID 


Or. 


position 




SEQ ID 


Or. 


position 








pos x t ion 


1298 


F 


47926 




2893 


F 


829613 










1299 


F 


46059 




2894 


F 


831995 










1300 


F 


48403 




2895 


F 


83 0093 








446608 


1301 


F 


46485 




2896 


F 


832585 








448508 


1302 


F 


49871 




2897 


F 


830686 








A A B O Q S3 


1303 


F 


47980 




2898 


F 


833149 










1304 


F 


50706 




2899 


F 


831240 








44 9798 


1305 


F 


48792 




2900 


F 


833660 










1306 


F 


52129 


2901 


F 


831704 










1307 


F 


50199 




2902 


F 


834442 










1308 


F 


53247 


2903 


F 


832539 






451891 


1309 


F 


51346 


2904 


F 


835147 








1310 


F 


54376 


2905 


F 


833252 








1311 


F 


52462 


2906 


F 


835536 








1312 


F 


54790 


2907 


F 


833656 








1313 


F 


52890 


2908 


F 


836378 








1314 


F 


55404 


2909 


F 


834480 






45508 8 


1315 


F 


53540 


2910 


F 


836990 








1316 


F 


56602 


2911 


F 


835067 








1317 


F 


54695 


2912 


F 


838512 








1318 


F 


58151 


2913 


F 


836603 


4508 






1319 


F 


56284 


2914 


F 


839718 


4509 




458221 


1320 


F 


58965 


2915 


F 


837811 








1321 


F 


57039 


2916 


F 


840211 








1322 


F 


59955 


2917 


F 


838266 








1323 


F 


58032 


2918 


F 


841434 






4 60570 


1324 


F 


61247 


2919 


F 


839485 








1325 


F 


59364 




2920 


F 


842250 










1326 


F 


62249 




2921 


F 


840377 










1327 


F 


60375 




2922 


F 


842761 










1328 


F 


63117 




2923 


F 


840912 




4518 


B 


461887 


1329 


F 


61247 




2924 


F 


343000 




4519 


B 


463795 


1330 


F 


53829 




2925 


F 


341103 




4520 


B 


462842 


1331 


F 


51908 




2926 


F 


343583 




4521 


B 


464780 
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SEQID 


Or. 


position 




SEQ ID 


Or . 


position 




SEQ ID 




position 


1332 


F 


64066 




2927 


F 












1333 


F 


62136 




2928 


F 


845985 










1334 


F 


64369 


2929 


F 


844098 










1335 


F 


62437 




2930 


F 


847919 








466801 


1336 


F 


65124 




2931 


F 


846025 








4 6 6 078 


1337 


F 


63225 




2932 


F 












1338 


F 


67407 




2933 


F 


848109 










1339 


F 


65513 




2934 


F 










469540 


1340 


F 


68652 


2935 


F 


849547 








^fc u 2 4 U O 


1341 


F 


66758 


2936 


F 










1342 


F 


68946 


2937 


F 


851567 








1343 


F 


67080 


2938 


F 


854701 






4714 00 


1344 


F 


69660 


2939 


F 


852801 






469895 


1345 


F 


67818 


2940 


F 








471798 


1346 


F 


70432 


2941 


F 


853282 








1347 


F 


68572 


2942 


F 


856012 








1348 


F 


70866 


2943 


F 


854111 








1349 


F 


68946 


2944 


F 


857227 








1350 


F 


73272 


2945 


F 


855326 








1351 


F 


71373 


2946 


F 


859309 








1352 


F 


74657 


2947 


F 


857458 








1353 


F 


72752 


2948 


F 


859418 








1354 


F 


75282 


2949 


F 


857515 








1355 


F 


73383 


2950 


F 


860468 






4 7 66GS 


1356 


F 


76781 


2951 


F 










1357 


F 


74878 




2952 


F 


861361 








1358 


F 


76925 




2953 


F 


859441 










1359 


F 


75017 




2954 


F 










478682 


1360 


F 


77935 




2955 


F 


859979 








478861 


1361 


F 


76028 




2956 


F 


863352 








4 8 0821 


1362 


F 


79611 




2957 


F 


861444 




4552 


B 


479311 


1363 


F 


77750 




2958 


F 


363777 




4553 


B 


481243 


1364 


F 


82371 




2959 


F 


361872 




4554 


B 


479943 


1365 


F 


80509 




2960 


F 


364636 




4555 


B 


481858 



WO 99/28475 



SEQ ID 


Or. 




1366 


F 


83502 


1367 


p 


81655 


1368 


F 


84657 


1369 


F 


82740 


1370 


F 


87093 


1371 


F 


85186 


1372 


F 


87188 


1373 


p 


85320 


1374 


F 


88179 


1375 


F 


86281 


1376 


F 


88486 


1377 


F 


86598 


1378 


F 


89077 


1379 


F 


87236 


1380 


F 


89495 


1381 


F 


87578 


1382 


p 


91202 


1383 


p 


89232 


1384 


F 


91526 


1385 


F 


89598 


1386 


F 


92085 


1387 


F 


90203 


1388 


p 


93104 


1389 


F 


91239 


1390 


F 


93833 


1391 


F 


91938 


1392 


F 


94392 


1393 


F 


92508 


1394 


F 


97894 


1395 


F 




1396 


F 


98502 


1397 


F 


96620 


1398 


F 


100117 


1399 


F 


98215 



233 







pos i tion 






862 792 






ODuUO'l 


2953 




864184 












O D^± D (J (J 








2967 
















8 6 696 0 


2970 




ft £ Qfi c: n 
























8 722 10 


2975 






297G 




8 72497 


2977 




8 7 0 5 97 


2978 






2979 






2980 




873 800 






871909 


2982 












2984 






2985 






2986 






2987 




3 74 84 g 


2988 


















2991 




876044 


2992 




878633 


2993 


F 


376695 


2994 


F 


378886 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


4556 


B 


480257 


4557 


B 


482146 


4558 


B 


481708 


4559 


B 


483633 


4560 


B 


481969 


4561 


B 


483871 


4562 


B 


483668 


4563 


B 


485559 


4564 


B 


485198 


4565 


B 


487094 


4566 


B 


488084 


4567 


B 


409985 


4568 


B 


485945 


4569 


B 


487859 


4570 


B 


489498 


4571 


B 


491367 


4572 


B 


488799 


4573 


B 


490691 


4574 


B 


490677 


4575 




492589 


4576 




492994 


4577 


B 


494929 


4578 


B 


493113 


4579 


B 


495035 


4580 


B 


493985 


4581 


B 


495864 


4582 


B 


494929 


4583 


B 


496801 


4584 


B 


495090 


4585 


B 


496989 








4587 


B 


497485 


4588 


B 


495436 


4589 


B 


497304 



WO 99/28475 

234 



SEQ ID 




position 




SEQ ID 




position 


1400 


p 


101104 










1401 


p 










879824 


1402 


p 












1403 


p 


i onnso 








88 0670 


1404 


p 


102499 










1405 












881719 


1406 




104014 










1407 


p 










8 82682 


1408 


p 


105028 










1409 


p 


103092 










1410 


p 


107210 










141 1 


p 












1412 


p 


108446 










1413 


p 


106545 










1414 


p 


108792 










1415 


p 


106853 










1416 


p 


109472 








8 8 6 762 


1417 


p 










o y u u o 4 


1418 


p 


1 1 1060 








OQQ1 on 

o o oloZ 


1419 


p 












1420 


p 


1 12669 








o o o y y o 


1421 


F 


1 10796 








8 9 174 9 


1422 


F 


113335 








88983 0 


1423 


p 












1424 


p 


1 13733 










1425 


p 


111882 










1426 


p 


114479 










1427 


F 












1428 


F 


1 15138 










1429 


p 










3 94 0 05 


1430 


F 


115765 




3025 


F 


892127 


1431 


F 


113891 




3026 




894827 


1432 


F 


119580 




3027 


F 


892900 


1433 


F 


117660 




3028 


F 


895732 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


4590 


B 


496854 


4591 


B 


498754 


4592 


B 


497396 


4593 


B 


499316 


4594 


B 


498735 


4595 


B 


500635 


4596 


B 


499484 


4597 


B 


501409 


4598 


B 


501005 


4599 


B 


502852 


4600 


B 


501937 


4601 


B 


503853 


4602 


B 


503083 


4603 


B 


505003 


4604 


B 


503895 


4605 


B 


505846 


4606 


B 


505263 


4607 


B 


507137 


4608 


B 


507214 


4609 


B 


509106 


4610 




507687 






509559 




B 


508632 


4613 


B 


510534 


4614 


B 


508863 


4615 


B 


510730 


4616 


B 


509202 


4617 


B 


511062 


4618 


B 


510940 


4619 


B 


512832 








4621 


B 


513649 


4622 


B 


512446 


4623 


B 


514305 



WO 99/28475 
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PCT/1B98/01939 



SEQID 


Or. 


position 




SEQ ID 








SEQ ID 




position 


1434 


F 


123834 




3029 




8 93 7 90 










1435 


F 


121914 




3030 




896823 








515212 


1436 


F 


124649 




3031 




8 94907 








514529 


1437 


F 


122753 




3032 


p 


900571 










1438 


F 


125280 


3033 


F 


898639 










1439 


F 


123416 




3034 


F 


902407 










1440 


F 


126101 




3035 




900507 








515496 


1441 


F 


124208 




3036 












517389 


1442 


F 


126871 


3037 














1443 


F 


125013 


3038 


F 










1444 


F 


127698 


3039 










516642 


1445 


F 


125787 


3040 


F 


90548 6 






518551 


1446 


F 


129465 


3041 


F 


903589 






517420 


1447 


F 


127467 


3042 


F 


906234 






519349 


1448 


F 


130799 


3 043 




904350 






518187 


1449 


F 


128869 


3 044 










520053 


1450 


F 


131615 


3045 


F 


904846 






518617 


1451 


F 


129711 


304S 


F 


907868 






520500 


1452 


F 


132856 


3047 


F 


905924 








1453 


F 


130914 


3048 


p 


908501 








1454 


F 


133401 


3049 


F 










1455 


F 


131474 


3050 




908975 








1456 


F 


133624 


3051 


F 


907079 








1457 


F 


131706 


3052 


F 


909351 








1458 


F 


134385 


3053 


F 










1459 


F 


132500 


3054 


F 


909835 








1460 


F 


137183 


3055 










522354 


1461 


F 


135320 


3056 




9103 82 






524287 


1462 


F 


140106 




3057 




908496 








523763 


1463 


F 


138215 




3058 












525689 


1464 


F 


140839 




3059 


F 


908829 




4654 


B 


524854 


1465 


F 


138927 




3060 


F 


912169 




4655 


B 


526756 


1466 


F 


141535 




3061 


F 


910248 




4656 


B 


525970 


1467 


F 


139614 




3062 


F 


912376 




4657 


B 


527866 



WO 99/28475 



236 



PCT/IB98/01939 



SEQ ID 


Or. 










positxon 




SEQ ID 


Or. 


position 


1468 


F 


142909 




3063 












526312 


1469 


F 


140952 




3064 


F 


912 984 








528202 


1470 


F 


143684 








911084 








526640 


1471 


F 


141782 
















528553 


1472 


F 


144309 




3067 












526991 


1473 


F 


142375 




3068 




914282 








528855 


1474 


F 


146178 




3069 


F 










528553 


1475 


F 


144294 




3070 








4665 




530443 


1476 


F 


146894 




3071 










529081 


1477 


F 


144997 


3 072 








B 


53 0988 


1478 


F 


147858 








4668 


B 


529943 


1479 


F 


145960 


3 074 




91582 7 


4669 


B 


531844 


1480 


F 


148277 


3 075 






4670 


B 


530424 


1481 


F 


146347 


3076 


p 






B 


532301 


1482 


F 


148781 


3 077 




914788 


4672 


B 


530799 


1483 


F 


146846 


3 078 








B 


532675 


1484 


F 


148947 


3 079 








B 


531670 


1485 


F 


147021 


3080 




918089 






533594 


1486 


F 


149424 


3081 










533498 


1487 


F 


147592 


3082 


F 


9183 99 






535393 


1488 


F 


150769 


3083 


F 








534147 


1489 


F 


148884 






919296 




B 


535997 


1490 


F 


151743 


3085 








B 


534892 


1491 


F 


149880 


3086 








B 


536813 


1492 


F 


152659 








4682 


B 


536191 


1493 


F 


150769 






919864 


4683 




538068 


1494 


F 


153101 


3089 








B 


539438 


1495 


F 


151270 




3090 








B 


541306 


1496 


F 


[53719 












4686 


B 


540771 


1497 


F 


[51850 














B 


542639 


1498 


F 


155002 




3093 


F 


919138 




4688 


B 


541223 


1499 


F 


153096 




3094 


F 


921239 




4689 


B 


543141 


1500 


F 


156550 




3095 


F 


919366 




4690 


B 


542025 


1501 


F 


154687 




3096 


F 


921526 




4691 


B 


543927 



WO 99/28475 
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PCT/IB98/01939 



SEQ ID 


Or. 


position 




SEQ ID 




position 




SEQ ID 


Or. 


position 


1502 


p 














4692 


B 


543495 


1503 


p 


155353 












4693 


B 


545375 


1504 


p 














4694 


B 


544367 


1505 


p 










9222 12 




4695 


B 


546253 


1506 


p 


159676 












4696 


B 


544790 


1507 


p 


157795 












4697 


B 


546697 


1508 


p 


160957 












4698 


B 


544982 


1509 


p 


159063 








923258 




4699 


B 


546890 


1510 


p 














4700 


B 


546655 


1511 


p 


159504 












4701 


B 


548555 


1512 


p 










921929 




4702 


B 


547701 


1513 


p 


1 60240 








9241£5 




4703 


B 


549667 


1514 


p 














4704 


B 


547609 


1515 


p 


1 








92468 0 




4705 


B 


549533 


1516 


F 


164236 












4706 


B 


548121 


1517 


p 


162345 












4707 


3 


550040 


1518 


p 


165837 












4708 


B 


548878 


1519 


p 


163923 












4709 


B 


550836 


1520 


p 


166508 












4710 


B 


549681 


1521 


p 










926972 




4711 


B 


551602 


1522 
















4712 


B 


550605 


1523 


p 


166683 












4713 


B 


552527 


1524 


p 


169367 












4714 


B 


551849 


1525 


p 














4715 


B 


553750 


1526 


p 


170556 












4716 


B 


553261 


1527 


p 


168661 








9289 74 




4717 


B 


555165 


1528 


p 










927031 




4718 


B 


555176 


1529 


p 










93 0003 




4719 


B 


557075 


1530 












928103 




4720 


B 


556590 


1531 


p 














4721 


B 


558489 


1532 


F 


172797 




3127 


F 


928402 




4722 


B 




1533 


F 


170904 




3128 


F 


931084 




4723 


B 


559024 


1534 


F 


174176 




3129 


F 


929222 




4724 


B 


558346 


1535 


F 


172236 




3130 


F 


931307 




4725 


B 


560246 



WO 99/28475 



SEQ ID 




position 


1536 






1537 




173 138 


1538 


p 




1539 


p 


173595 


1540 


p 


177183 


1541 


p 


175275 


1542 


p 




1543 






1544 


p 


179819 


1545 


p 


177923 


1546 


p 




1547 


p 


178455 


1548 


p 




1549 


p 


179226 


1550 


p 


182628 


1551 


p 


180694 


1552 


p 


183403 


1553 


p 


181494 


1554 


p 


T^S 


1555 


p 




1556 


p 


T85763 


1557 






1558 


p 


1 86496 


1559 


p 


184614 


1560 


p 




1561 


p 


1 85262 


1562 


p 




1563 


p 




1564 


p 




1565 






1566 


F 


190016 


1567 


F 


188083 


1568 


F 


190545 


1569 


F 


188666 



238 



SEQ ID 




pos i tion 












931824 
























933 044 












933303 








































































94 0791 






938889 






94 1806 












944314 


















943 090 




















3161 




944986 


3162 


F 


948258 


3163 


F 


946403 


3164 


F 


949037 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


4726 


B 


558455 


4727 


B 


560376 


4728 


B 


559450 


4729 


B 


561362 


4730 


B 


561581 


4731 


B 


563478 


4732 


B 


563153 


4733 


B 


565073 


4734 


B 


564319 


4735 


B 


566220 


4736 


B 


564201 


4737 


B 


566078 


4738 


B 


567243 


4739 


B 


569143 


4740 


B 


568192 


4741 


B 


570133 


4742 


B 


570619 


4743 


B 


572532 


4744 


B 


572241 


4745 


B 


574208 


4746 


B 


572994 


4747 


B 


574916 


4748 


B 


573744 


4749 


B 


575640 


4750 


B 


573679 


4751 


B 


575571 


4752 


B 


574398 


4753 


B 


576288 


4754 


B 


574677 


4755 


B 


576633 




B 


575033 


4757 


B 


576922 


4758 


B 


575482 


4759 


B 


577363 



WO 99/28475 
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PCT/IB98/01939 



SEQID 


Or. 










position 




SEQ ID 




position 


1570 


F 


191538 




3165 


F 












1571 


F 


189595 




3166 




949581 








577606 


1572 


F 


192173 
















576078 


1573 


F 


190247 








950455 








577993 


1574 


F 


193015 








948558 








578265 


1575 


F 


191135 
















580143 


1576 


F 


194471 




3171 










578948 


1577 


F 


192522 




3172 










580848 


1578 


F 


194946 


3173 










582336 


1579 


F 


193015 












584225 


1580 


F 


196798 








4770 


B 


582917 


1581 


F 


194896 


3176 






4771 


B 


584 317 


1582 


F 


197440 






952288 






583359 


1583 


F 


195550 


3178 


F 








585252 


1584 


F 


197440 


3179 










583734 


1585 


F 


195549 


3180 


F 








585634 


1586 


F 


198736 


3181 




95 570 3 






584122 


1587 


F 


196802 


3182 




959053 






585990 


1588 


F 


199722 












584665 


1589 


F 


197822 


3184 




960824 








1590 


F 


200003 


3185 


F 


958932 








1591 


F 


198147 


3 186 










587123 


1592 


F 


200361 


3187 


p 


959368 






585517 


1593 


F 


198453 












587434 


1594 


F 


200945 










B 


586850 


1595 


F 


199009 


3190 










588730 


1596 


F 


202122 


3191 










588017 


1597 


F 


200215 


3192 




954706 






589878 


1598 


F 


203251 






962 822 




B 


589628 


1599 


F 


201352 


3194 










591543 


1600 


F 


203807 




3195 


F 


964074 


4790 


B 


589798 


1601 


F 


201903 




3196 


F 


967210 


4791 


B 


591723 


1602 


F 


206555 




3197 


F 


965307 


4792 


B 


590323 


1603 


F 


204669 




3198 


F 


967808 


4793 


B 


592211 



WO 99/28475 



SEQID 




position 












205369 


1606 




208293 


1607 






1608 




209252 


1609 


p 




1610 




210330 


1611 






1612 






1613 


p 


208694 


1614 






1615 


p 




1616 




212650 


1617 






1618 




213920 


1619 


p 




1620 


p 




1621 


p 




1622 




215003 


1623 






1624 






1625 




214772 


1626 






1627 


p 




1628 






1629 






1630 






1631 






1632 




218960 


1633 






1634 


F 


219646 


1635 


F 


217772 


1636 


F 


220257 


1637 


F 


218379 



240 



SEQ ID 


Or. 


position 


3199 


F 


965925 


3200 




969286 


3201 




967320 






970802 






968888 






972169 


3205 


F 


970269 


3206 




973487 






971616 






974339 






9724 08 






974988 




F 


973035 


3212 


F 


976035 






974114 


















976665 






974730 






977439 


3219 


F 


975500 




F 


977698 




F 


975799 






978389 






976478 




F 


978665 


3225 




976760 


3226 


F 


979134 




F 


977270 






979473 








3230 


F 


980915 


3231 


F 


979026 


3232 


F 


982043 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


4794 


B 


591492 


4795 


B 


593419 


4796 


B 


593002 


4797 


B 


594885 


4798 


B 


593367 


4799 


B 


595321 


4800 


B 


594166 


4801 


B 


596020 


4802 


B 


595942 


4803 


B 


597826 


4804 


B 


596354 


4805 


B 


598255 


4806 


B 


597147 


4807 


B 


598998 


4808 


B 


597960 


4809 


B 


599851 


4810 


B 


601068 


4811 


B 


602929 


4812 


B 


602096 


4813 


B 


603996 


4814 


B 


603761 


4815 


B 


605643 


4816 


B 


604014 


4817 


B 


605920 


4818 


B 


604634 


4819 


B 


606548 


4820 


B 


605864 


4821 


B 


607736 


4822 


B 


606903 


4823 


B 


608742 


4824 


B 


607722 


4825 


B 


609674 


4826 


B 


609329 


4827 | B 


611215 
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PCT/IB98/01939 



SEQ ID 


Or. 










position 




SEQ ID 


Or. 


position 


1638 


F 


220903 








98 017 8 








609916 


1639 


p 


218989 








983628 






B 


611866 


1640 


p 


221314 
















612090 


1641 


p 


219470 








984 14 9 








613998 


1642 


p 


222253 




3237 




982271 








613166 


1643 


F 


220338 








98518 0 








615091 


1644 


F 


223 1 86 








Q P 7 O Q 0 

y o j z ts u 




4834 


B 


613838 


1645 


F 


221278 












4835 


B 


615761 


1646 


p 


223994 








4836 


B 


614217 


1647 


F 


222146 








4837 


B 


616094 


1648 


F 


224908 








4838 


B 


615464 


1649 


F 


223014 


3244 






4839 




617391 


1650 


F 


225051 








4840 


B 


615913 


1651 


F 


223131 






98768 6 


4841 


B 


617803 


1652 


F 


225510 








4842 


B 


617932 


1653 


p 


223615 






9 8 8 559 




B 


619837 


1654 


F 


226550 






98 6 67 0 




B 


618598 


1655 


F 


224609 








4845 


B 


620532 


1656 


F 


226928 








4846 


B 


619591 


1657 


F 


225029 






993 122 




B 


621465 


1658 


F 


227528 






991289 




B 


620636 


1659 


F 


225631 










B 


622536 


1660 


p 


228388 










B 


620532 


1661 


F 


226475 








4851 


B 


622381 


1662 


p 


229930 






99172 7 


4852 


B 


621404 


1663 


F 


228032 






9940 0 7 


4853 


B 


623307 


1664 


F 


231129 








4854 


B 


621932 


1665 


F 


229205 








4855 


B 


623834 


1666 


p 


232785 








4856 


B 


622666 


1667 


p 


230915 








4857 


B 


624588 


1668 


F 


233561 




3263 


F 


994763 


4858 


B 


623689 


1669 


F 


231664 




3264 


F 


998586 


4859 


B 


625605 


1670 


F 


234013 




3265 


F 


996755 


4860 


B 


624435 


1671 


F 


232149 




3266 


F 


999153 




4861 


B 


626298 
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SEQ ID 




position 




SEQ ID 




position 


1672 


p 










997253 


1673 












1000967 


1674 












999092 


1675 












1001173 














999246 


1677 




236045 








1001604 


1678 




238482 








999645 


1679 












1004159 


1680 


p 


240094 








1002326 


1681 




238190 








1004763 


1682 












1002871 


1683 




239820 








1005160 


1684 












1003235 


1685 












1007181 


1686 












1005250 


1687 


p 


242360 








10 075 61 


1688 


p 










10 05665 


1689 


p 










10 0 8 8 55 


1690 












1007002 


1691 












1010205 


1692 




247498 








1008342 


1693 












1011716 


1694 


p 










1009823 


1695 


p 


246444 










1696 




249500 








1009914 


1697 












1012372 


1698 












1010385 


1699 


p 










1012567 


1700 












1010624 


1701 




248942 










1702 


F 


251847 




3297 


F 




1703 


F 


249939 




3298 


F 


1013690 


1704 


F 


254897 




3299 


F 


1011856 


1705 


F 252955 




3300 


F 


1014301 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


4862 


B 


625204 


4863 


B 


627128 


4864 


B 


626885 


4865 


B 


628790 


4866 


B 


627128 


4867 


B 


629026 


4868 


B 


628073 


4869 


B 


629983 


4870 


B 


628359 


4871 


B 


630267 


4872 


B 


628976 


4873 


B 


63085C 


4874 


B 


630023 


4875 


B 


631988 


4876 


B 


630642 


4877 


B 


632526 


4878 


B 


631205 


4879 


B 


633081 


4880 


B 


632046 


4881 


B 


633969 


4882 


B 


638311 


4883 


B 


640204 


4884 


B 


640448 


4885 


B 


642328 


4886 


B 


643695 


4887 


B 


645639 


4888 


B 


640847 


4889 


B 


642769 


4890 


B 


644745 


4891 


B 


646615 




B 


645686 


4893 


B 


647558 


4894 


B 


646060 


4895 


B 


647972 
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SEQ ID 


Or. 


position 




SEQ ID 


Or 


position 


1706 


p 


256543 










1707 


p 












1708 


F 


257692 








1013 010 


1709 


F 


255790 








10 15664 


1710 


F 


258561 








1013 82 0 


1711 


p 


256651 










1712 


p 










1015099 


1713 


p 


257036 










1714 


F 


261368 








1015786 


1715 


F 


259469 










1716 


p 


263887 








10164 60 


1717 


F 


262000 








1019602 


1718 


F 


264503 










1719 


p 


262599 








1019876 


1720 


F 


265364 










1721 


F 


263512 








102 0 853 


1722 


p 


266202 








1018 956 


1723 


F 


264277 








10218 7 8 


1724 


p 


266709 










1725 


F 


264801 








1VZ J Ub<± 


1726 


F 


267847 








1021186 


1727 


p 










1023415 


1728 


F 


267980 










1729 


F 


266077 








1023748 


1730 


F 


268271 










1731 


F 


266341 








1024485 


1732 


p 


269840 








1022574 


1733 


F 


267913 








1024744 


1734 


p 


270961 








1022 83 6 


1735 


F 










1025618 


1736 


F 


271883 




3331 


F 


1023720 


1737 


F 


270080 




3332 


F 


1026323 


1738 


F 


272642 




3333 


F 


1024403 


1739 


F 


270748 




3334 


F 


1027710 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


4896 


B 


647331 


4897 


B 


649231 


4898 


B 


649987 


4899 


B 


651829 


4900 


B 


650580 


4901 


B 


652484 


4902 


B 


651942 


4903 


B 


653852 


4904 


B 


652395 


4905 


B 


654310 


4906 


B 


653132 


4907 


B 


655028 


4908 


B 


653827 


4909 


B 


655713 


4910 


B 


662071 


4911 


B 


664023 


4912 


B 


662543 


4913 


B 


664403 


4914 


B 


663295 


4915 




665205 


4916 


B 


663972 


4917 


B 


665850 


4918 


B 


664432 


4919 


B 


666332 


4920 


B 


665860 


4921 


B 


667789 


4922 


B 


666312 


4923 


B 


668233 


4924 


B 


666652 


4925 


B 


668550 






668338 


4927 


B 


670238 


4928 


B 


668605 


4929 


B 


670495 
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SEQID 


Or. 










posi t ion 




SEQ ID 


Or. 


position 


1740 


p 


273367 








± U z, o ts u y 




4930 


B 


668690 


1741 




271477 








103 0272 


4931 




670590 


1742 


F 


274562 








1 r» 0 Q "5 ft Q 






B 


669766 


1743 


F 


272702 








1031486 






B 


671653 


1744 


F 


275882 








±uz y o u z 








670160 


1745 


F 


273984 


3340 












672109 


1746 


F 


278004 












B 


671000 


1747 


F 


276149 










B 


672900 


1748 


F 


278747 








4938 


B 


671470 


1749 


F 


276893 








4939 




673412 


1750 


p 


279521 






103 4 055 


4 94 0 


B 


672685 


1751 


F 


277632 






103 674 8 


4941 




674567 


1752 


F 


281076 






1034844 


4942 


B 


673461 


1753 


F 


279118 








4943 


B 


675365 


1754 


F 


281551 


3349 






4944 


B 


674786 


1755 


F 


279668 








4945 


B 


676682 


1756 


p 


282573 








4946 


B 


675456 


1757 


p 


280663 








4947 


B 


677375 


1758 


F 


284229 








4948 


B 


676683 


1759 


F 


282316 








4949 




678594 


1760 


F 


284598 


3355 










677334 


1761 


F 


282655 








4951 


B 


679183 


1762 


F 


285418 


3357 






4952 


B 


678726 


1763 


F 


283518 










B 


680596 


1764 


p 


286104 








4954 


B 


679729 


1765 


F 


284229 








4955 


B 


681628 


1766 


F 


286456 












680747 


1767 


F 


284531 












682668 


1768 


p 


287865 










4958 


B 


681500 


1769 


p 


286008 










4959 


B 


683406 


1770 


F 


289163 




3365 


F 


79840 


4960 


B 


682779 


1771 


F 


287384 




3366 


F 


86772 




4961 


B 


684716 


1772 


F 


290609 




3367 


F 


84880 




4962 


B 


683320 


1773 


F 


288709 




3368 


F 


109188 




4963 


B |S85249 
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SEQID 




position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 














107337 




4964 


B 


684716 






289389 












4965 


B 


686585 






292107 








109188 




4966 


B 


685010 


1777 












111505 




4967 


B 


686897 






293099 








109597 




4968 


B 


686423 


1779 












112432 




4969 


B 


688323 














110462 


4970 


B 


687426 






292883 






113446 


4971 


B 


689324 


1782 










111592 


4972 


B 


688619 


1783 










120225 


4973 


B 


690482 


1784 


p 


296018 






118303 


4974 


B 


688653 


1785 




294095 






124892 


4975 


B 


690563 


1786 










123004 


4976 


B 


689836 


1787 










131327 


4977 


B 


691775 


1788 


p 


7Q8ARA 






129485 


4978 


B 


690186 


1789 


p 










4979 


B 


692088 


1790 


p 


300305 








4980 


B 


690715 


1791 


p 


298407 








4981 


B 


692616 


1792 












4982 


B 


690937 


1793 




299946 






163715 


4983 


B 


692837 


1794 












4984 


B 


692091 


1795 


p 


302849 








4985 


B 


693991 


1796 












4986 


B 


694171 


1797 




303992 








4987 


B 


696078 


1798 










166302 


4988 


B 


695197 


1799 




304303 






170287 


4989 


B 


697093 


1800 










168387 


4990 


B 


697486 


1801 




304856 






176838 


4991 


B 


699428 


1802 










174996 


4992 


B 


698313 


1803 












4993 


B 


700238 


1804 


F 


309635 




3399 


F 


185900 








1805 


F 


307750 




3400 


F 


188083 




4995 


B 


700515 


1806 


F 


310491 




3401 


F 


186208 




4996 


B 


700337 


1807 


F 


308597 




3402 


F 


190117 




4997 


B 


702249 
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SEQID 


Or. 


position 








pos i t ion 




SEQ ID 


Or. 


position 


1808 


p 










188168 




4998 


B 


701115 


1809 


p 


309790 












4999 


B 


703015 


1810 


p 


313188 








194 946 






B 


702385 


1811 


p 


31 1292 




3406 












704285 


1812 


F 


314121 














703636 


1813 


p 


312194 












B 


705561 


1814 


p 


314489 










B 


705271 


1815 


p 










5005 


B 


707136 


1816 


F 


315431 








5006 




705875 


1817 


p 


313526 






24 9227 


5007 




707725 


1818 


F 


316309 


3413 






5008 




706444 


1819 


F 


314380 






252 939 






708279 


1820 


F 


317102 








5010 


B 


706741 


1821 


F 


315214 








5011 


B 


708673 


1822 


F 


317271 








5012 


B 


708324 


1823 


F 


315343 


3418 






5013 


B 


710226 


1824 


F 


317380 










B 


708673 


1825 


F 


315480 


342 0 




2 753 90 






710518 


1826 


p 


318256 


3421 










708876 


1827 


F 


316352 










B 


710791 


1828 


F 


319047 










B 


710498 


1829 


F 


317181 


3424 




2 82260 






712447 


1830 


F 


320325 












711435 


1831 


F 


318338 


3426 




2 92925 






713354 


1832 


F 


321228 










B 


712993 


1833 


F 


319366 










B 


714887 


1834 


F 


321676 












713686 


1835 


F 


319782 








5025 


B 


715574 


1836 


F 


322066 






3 06806 


5026 


B 


714474 


1837 


p 


320097 










5027 


B 


716354 


1838 


F 


322910 




3433 


F 


310073 




5028 


B 


714867 


1839 


F 


320982 




3434 


F 


312375 




5029 


B 


716760 


1840 


F 


324744 




3435 


F 


310483 




5030 


B 


716047 


1841 


F 322849 




3436 


F 


312531 




5031 


B 


717877 
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position 




SEQ ID 




position 




SEQ ID 


Or . 


position 


1842 


p 














5032 


B 


716086 


1843 
















5033 




717976 


1844 


p 










318 0 09 




5034 




717189 


















5035 


B 


719068 


1846 


p 


327038 
















718624 


1847 


p 


325162 










5037 


B 


720503 


1848 


p 












5038 


B 


719083 


1849 


p 


326079 








5039 




720983 


1850 


p 










5040 


B 


720047 


1851 


p 










5041 


B 


722004 


1852 




329032 






375085 


5042 


B 


720503 


1853 










378954 


5043 


B 


722393 


1854 


p 


329329 








5044 


B 


720753 


1855 


p 


327489 








5045 


B 


722653 


1856 


p 


330446 








5046 




721798 


1857 


p 


32855 1 






4fc u u y _L 3 




B 


723724 


1858 


p 










5048 


B 


722631 


1859 




329032 






4 0 9744 


5049 


B 


724493 


1860 


p 


331410 






4 07 904 


5050 


B 


723468 


1861 


p 










5051 


B 


725376 


1862 


p 


332534 






409253 






724852 


1863 


p 


330626 








5053 




726743 


1864 


p 


332782 








5054 


B 


726005 


1865 




330879 








5055 


B 


727903 


1866 


p 


333587 








5056 


B 


726779 


1867 


p 










5057 


B 


728691 


1868 


p 


333870 








5058 


B 


727058 


1869 


p 










5059 


B 


728947 


1870 


p 










5060 


B 


727727 


1871 












5061 


B 


729613 


1872 


F 


334958 


3467 


F 


440364 


5062 




72 8224 


1873 


F 


333049 


3468 


F 


443303 




5063 


B 


730116 


1874 


F 


334958 


3469 


F 


441380 




5064 


B 


729048 


1875 


F 


333049 




3470 


F 


442939 




5065 


B 


730907 
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SEQID 


Or. 






SEQ ID 








SEQ ID 




position 


1876 


F 


335655 








441047 








729566 


1877 


F 


333766 
















731468 


1878 


F 


336117 




3473 












732909 


1879 


F 


334219 


















1880 


F 


337108 
















734663 


1881 


F 


335210 




3476 














1882 


F 


34025 1 




3477 












735879 


1883 


F 


338372 
















737785 


1884 


F 


341538 


3479 










736724 


1885 


F 


339662 


3480 




496852 






738632 


1886 


F 


341953 


3481 




494952 






737474 


1887 


F 


339995 






499979 






739421 


1888 


F 


342348 


3483 




498074 






738007 


1889 


F 


340450 


3484 






5079 




739907 


1890 


F 


343112 






506798 






738911 


1891 


F 


341242 


3486 










740799 


1892 


F 


343736 


348 7 










739960 


1893 


F 


341811 


3488 


p 








741908 


1894 


F 


344117 


348 9 










742277 


1895 


F 


342207 












744187 


1896 


F 


344940 


3491 


F 










1897 


F 


343000 












744989 


1898 


F 


345837 


3493 


F 


554706 








1899 


F 


343958 


3494 










745539 


1900 


F 


346872 


3495 










744565 


1901 


F 


344994 


3496 










746432 


1902 


F 


347910 












744977 


1903 


F 


345971 














746867 


1904 


F 


350124 














745249 


1905 


F 


348298 






57 0844 








747138 


1906 


F 


351095 




3501 


F 


568915 




5096 


B 


745777 


1907 


F 


349167 




3502 


F 


575571 




5097 


B 


747677 


1908 


F 


351996 




3503 


F 


573671 




5098 


B 


746632 


1909 


F 


350122 




3504 


F 


590045 




5099 


B 


748532 
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SEQ ID 


Or. 










position 




SEQ ID 


Or. 


position 


1910 


p 


353051 
















747054 


191 1 




351186 














B 


748893 


1912 


p 


















748519 


1913 


p 


351481 
















750396 


1914 


p 


353908 
















1915 


F 


351996 
















1916 


p 


354723 










5106 


B 


749646 


1917 


p 


352799 






6 0 9842 


5107 


B 


751546 


1918 


p 


356466 






6 0 7958 


5108 


B 


749922 


1919 


p 


354569 












751824 


1920 


p 


357107 








5110 


B 


750260 


1921 


F 


355178 


3516 






5111 


B 


752151 


1922 


p 


357767 










B 


752527 


1923 


p 


355878 








5113 


B 


754427 


1924 


p 


360528 






D *± /OUU 


5114 


B 


753169 


1925 


F 


358628 










B 


755064 


1926 


F 


360877 












755004 


1927 


p 


358974 


3 522 










756843 


1928 


p 


361573 












757034 


1929 


p 


359692 


3524 




660587 






758991 


1930 


F 


362584 












758532 


1931 


F 


360681 










B 


760452 


1932 


F 


363835 












758911 


1933 


p 


361966 












760841 


1934 


F 


364960 












760015 


1935 


F 


363021 


353 0 










761913 


1936 


p 


365240 








5126 


B 


760463 


1937 


F 


363360 








5127 


B 


762363 


1938 


p 


367060 








5128 


B 


760782 


1939 












5129 


B 


762671 


1940 


F 


368383 


3535 


F 


756641 


5130 


B 


762053 


1941 


F 


366505 




3536 


F 


760010 


5131 


B 


763911 


1942 


F 


368862 




3537 


F 


758082 




5132 


B 


762363 


1943 


F 


366963 




3538 


F 


770670 




5133 


B 


764264 



WO 99/28475 



SEQID 


Or. 


position 




F 


370513 






368631 




F 


370974 


1947 


F 


369076 




F 


372891 






370980 


1950 


F 


373395 




F 


371495 






374005 






372033 




F 


374474 




F 


372572 






376509 


1957 










377630 


1959 




375708 






378384 






376507 




F 


378798 






376871 






379413 




F 


377501 






379890 




F 


377989 


1968 


F 


381241 




F 


379348 




F 


382485 






380579 




F 


383395 




F 


381536 


1974 


F 


383730 


1975 


F 


381782 


1976 


F 


384948 


1977 


F |383057 



250 



SEQ ID 


Or. 


position 


3539 




768751 


3540 




771896 


3541 


F 


769996 


3542 


F 


787857 


3543 


F 


785958 


3544 


F 


815714 


3545 


F 


813840 


3546 


F 


846380 


3547 


F 


844470 


3548 


F 


867576 


3549 


F 


865673 


3550 


F 


875167 


3 551 


F 


873254 


3552 


F 


876214 


3553 


F 


874314 


3554 


F 


884093 


3555 


F 


882162 


3556 


F 


891248 


3557 


F 


889348 


3558 


F 


900125 


3559 


F 


898298 


3560 


F 


902048 


3561 


F 


900125 


3562 


F 


907563 


3563 


F 


905656 


3564 


F 


912076 


3565 


F 


910133 


3566 


F 


935157 


3567 


F 


933211 


3568 


F 


946473 






944568 


3570 


F 


952562 


3571 


F 


950664 


3572 


F 


965649 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


5134 


B 


763203 


5135 


B 


765107 


5136 


B 


764690 


5137 


B 


766595 


5138 


B 


765107 


5139 


B 


766977 


5140 


B 


766327 


5141 


B 


768221 


5142 


B 


766932 


5143 


B 


768851 


5144 


B 


768314 


5145 


B 


770221 


5146 


B 


769045 


5147 


B 


770945 


5148 


B 


770315 


5149 


B 


772234 


5150 


B 


770705 


5151 


B 


772598 


5152 


B 


770882 


5153 


B 


772781 


5154 


B 


771156 


5155 


B 


773044 


5156 


B 


772234 


5157 


B 


774148 


5158 


B 


773611 


5159 


B 


775511 


5160 


B 


774513 


5161 


B 


776404 


5162 


B 


776333 


5163 


B 


778191 


5164 


B 


777926 


5165 


B 


779832 


5166 


B 


777455 


5167 


B 


779380 



WO 99/28475 



251 



PCT/1B98/01939 



SEQID 


Or. 


position 




SEQ ID 




position 




SEQ ID 




position 


1978 


F 


385474 




3573 












779476 


1979 


F 


383532 




3574 




968519 










1980 


F 


385908 




3575 


F 












1981 


F 


384008 




3576 


F 


970497 








783686 


1982 


F 


386643 




3577 


F 


968601 








/OZDD / 


1983 


F 


384750 




3578 


F 


971879 








784562 


1984 


F 


387099 




3579 




970043 








1985 


F 


385204 




3580 


F 


972888 






787658 


1986 


F 


387581 




3581 


F 


970962 






786222 


1987 


F 


385677 


3582 




998162 






788126 


1988 


F 


388009 


3583 










786803 


1989 


F 


386062 


3584 


F 


1003657 






788703 


1990 


F 


388927 


3585 




1001756 






787998 


1991 


F 


387033 


3586 




1009313 






789876 


1992 


F 


389726 


3587 


F 


1007413 






788279 


1993 


F 


387821 


3588 


F 


102 8 954 








1994 


F 


391295 


3589 


F 


102703 9 








1995 


F 


389365 


3590 


B 










1996 


F 


392171 


3591 


B 


2645 






790862 


1997 


F 


390291 


3592 


B 


3521 








1998 


F 


393930 


3593 


B 










1999 


F 


392014 


3594 


B 


5295 








2000 


F 


395085 


3595 


B 


7188 








2001 


F 


393185 


3596 


B 


6740 








2002 


F 


395827 


3597 


B 


8652 








2003 


F 


393940 


3598 


B 










2004 


F 


396274 


3599 


B 










2005 


F 


394423 


3600 


B 










2006 


F 


397156 


3601 










795667 


2007 


F 


395216 












797571 


2008 


F 


398641 


3603 


B 


11160 




5198 


B 


796515 


2009 


F 


396790 




3604 


B 


9689 




5199 


B 


798382 


2010 


F 


399550 




3605 


B 


11591 




5200 


B 


797235 


2011 


F 


397659 




3606 


B 


10679 




5201 


B 


799168 



WO 99/28475 



SEQID 




position 






399797 






397915 






401527 






399623 












399960 






403017 






401131 






403017 












Af\AQl f\ 






4U3U1U 


2024 




405728 


2025 




403836 






406837 






/in/ion 












408347 






41 1488 






4095 1 8 


2032 










410487 












411263 


2036 






2037 












2039 










414921 






413049 


2042 


p 




2043 


F 


414606 


2044 


F 


417445 


2045 


F 


415557 



252 



SEQ ID 




position 


















13376 






12136 






14038 






13565 






15450 






14667 






16535 






15254 






17183 






16189 
























16710 






18587 






18458 
























22438 






24335 






22892 


















24924 






23442 






25313 














3638 




24822 


3639 


3 


26708 


3640 


B 


26618 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


5202 


B 


797585 


5203 


B 


799553 


5204 


B 


798102 


5205 


B 


799976 


5206 


B 


798306 


5207 


B 


800214 


5208 


B 


798662 


5209 


B 


800546 


5210 


B 


799168 


5211 


B 


801074 


5212 


B 


800159 


5213 


B 


802054 


5214 


B 


801153 


5215 


B 


803031 


5216 


B 


802363 


5217 


B 


804257 


5218 


B 


802893 


5219 


B 


804802 


5220 


B 


803466 


5221 


B 


805366 


5222 


B 


804440 


5223 


B 


806374 


5224 


B 


805576 


5225 


B 


807523 


5226 


B 


806511 


5227 


B 


808410 


5228 


B 


808305 


5229 


B 


810207 


5230 


B 


809229 


5231 


B 


811164 




B 


810700 


5233 


B 


812581 


5234 


B 


811825 


5235 


B |813787 



WO 99/28475 



253 



PCT/IB98/01939 



SEQID 


Or. 










position 




SEQ ID 




position 


2046 


F 


417812 




3641 


B 












2047 


F 


415912 




3642 














2048 


F 


418381 




3643 




29859 










2049 


F 


416517 




3644 


B 












2050 


F 


419453 




3645 


B 












2051 


F 


417447 




3646 












815892 


2052 


F 


420026 




3647 












814571 


2053 


F 


418068 




3648 














2054 


F 


421270 




3649 


B 


32849 










2055 


F 


419392 




3650 


B 


31516 








2056 


F 


422262 


3651 




33401 






815101 


2057 


F 


420364 


3652 


B 








817025 


2058 


F 


423007 


3653 










815953 


2059 


F 


421107 


3654 










817887 


2060 


F 


424834 


3655 


B 


35496 






817709 


2061 


F 


423018 


3S56 


B 








819618 


2062 


F 


426026 


3657 


B 


36052 






819559 


2063 


F 


424110 


3658 


B 


3 5251 








2064 


F 


426883 


3659 


B 


37164 








2065 


F 


425040 


3660 










821846 


2066 


F 


429303 


3661 










820622 


2067 


F 


427438 


3662 




36152 








2068 


F 


429609 


3663 


B 










2069 


F 


427737 


3664 


B 


37216 








2070 


F 


430104 


3665 










821778 


2071 


F 


428191 


3666 










323666 


2072 


F 


430622 


3667 










822494 


2073 


F 


428814 


3668 


B 










824387 


2074 


F 


431439 


3669 












823155 


2075 


F 


429537 




3670 














2076 


F 


431968 




3671 


B 


42074 




5266 


B 


824589 


2077 


F 


430108 




3672 


B 


40985 




5267 


B 


826509 


2078 


F 


434401 




3673 


B 


42887 




5268 


B 


826122 


2079 


F 


432485 




3674 


B 


42284 




5269 


B 


828041 



WO 99/28475 



SEQID 




position 


2080 


F 


436050 


2081 


F 


434134 


2082 




436342 






434428 






437672 






435775 




F 


438051 


2087 


F 


436165 


2088 


F 


438767 


2089 


F 


436866 






439139 










- F 








437574 


2094 


F 


440823 




F 


438923 




F . 


441668 






439746 






444271 






442371 


2100 


F 


446233 


2101 


F 


444302 




F 


447687 




F 


445803 






450318 




F 


448399 




F 


450876 




F 


449025 


2108 


F 


451274 






449397 


2110 


p 




2111 


F 


450513 


2112 


F 


453303 


2113 


F 


451427 



254 



SEQ ID 


Or. 


position 


3675 




44180 


3676 


B 


43337 


3677 


B 


45275 


3678 




44022 


3679 


B 


45927 


3680 


B 


45451 


3681 




47375 


3682 


B 


46308 


3683 


B 


4820B 


3684 


B 


46560 


3685 


B 


48485 


3686 


B 


47115 


3687 


B 


49053 


3688 


B 


48208 


3689 


B 


50118 




B 


48806 




B 


50708 


3692 


B 


50333 


3693 


B 


52220 


3694 




50960 


3695 




52890 


3696 


B 


52660 


3697 


B 


54606 


3698 


B 


53737 


3699 


B 


55645 




B 


54793 


3701 


B 


56691 


3702 


B 


55329 


3703 


B 


57226 


3704 


B 


56691 








3706 


B 


56317 


3707 


B 


58226 


3708 


B 


58469 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


5270 


B 


826572 


5271 


B 


828500 


5272 


B 


827246 


5273 


B 


829208 


5274 


B 


827489 


5275 


B 


829407 


5276 


B 


828274 


5277 


B 


830176 


5278 


B 


829495 


5279 


B 


831395 


5280 


B 


831192 


5281 


B 


833082 


5282 


B 


832005 


5283 


B 


833870 


5284 


B 


832373 


5285 


B 


834280 


5286 


B 


832964 


5287 


B 


834864 


5288 


B 


833672 


5289 


B 


835604 


5290 


B 


834022 


5291 


B 


835945 


5292 


B 


834907 


5293 


B 


836803 


5294 


B 


835683 


5295 


B 


837593 


5296 


B 


836018 


5297 


B 


837902 


5298 


B 


836714 


5299 


B 


838602 


5300 


B 


837484 


5301 


B 


839424 


5302 


B 


838950 


5303 


B 


840832 



WO 99/28475 



255 



PCT/IB98/01939 







position 




SEQ ID 




position 




SEQ ID 


Or. 


position 


21 14 
















5304 


B 


839800 


2115 
















5305 




841700 


2116 


p 














5306 




840808 


















5307 


B 


842715 


2118 




455808 












5308 


B 


841975 


2119 




453928 










5309 


B 


843834 


2120 


p 


45744O 










5310 




842679 


2121 


p 


455488 








5311 




844614 


2122 


p 










5312 


B 


843342 


2123 


p 


456354 








5313 


B 


845247 


2124 


p 










5314 


B 


843736 


2125 












5315 


B 


845626 


2126 












5316 


B 


846423 


2127 


p 


459060 








5317 


B 


848330 


2128 


p 


461659 








5318 


B 


844423 


2129 


p 


459758 








5319 


B 


846258 


2130 


p 


462674 








5320 


B 


848265 


2131 


p 










5321 


B 


850174 


2132 


p 


463788 








5322 


B 


850343 


2133 


p 


461895 








5323 


B 


852246 


2134 


p 


464479 










B 


851875 


2135 


p 


462602 










B 


853765 


2136 


p 


465882 










B 


853944 


2137 


p 


463989 








5327 


B 


855822 


2138 


p 


467200 








5328 


B 


855056 


2139 


p 


465300 








5329 


B 


856964 


2140 












5330 


B 


855750 


2141 


p 


466787 








5331 


B 


857714 


2142 


p 










5332 


B 


856488 


2143 












5333 


B 


858370 


2144 


F 


469572 


3739 


B 


73134 


5334 






2145 


F 


467707 


3740 


B 


73458 


5335 


B 


859303 


2146 


F 


470887 




3741 


B 


75376 


5336 


B 


859659 


2147 


F 


468984 




3742 


B 


75121 




5337 


B 


861570 



WO 99/28475 



256 



PCT/IB98/01939 



SEQ ID 


Or. 


position 




SEQ ID 




position 




SEQ ID 


Or. 


position 


2148 


p 


471590 












5338 


B 


860166 


2149 


p 


469835 














B 


8 62082 


2150 


p 


473033 




3745 












860879 


2151 


p 


471 133 
















862767 


2152 


p 


473761 














B 


861861 


2153 


p 


471861 
















863727 


2154 


p 














5344 


B 


862284 


2155 


p 


472478 












5345 


B 


864173 


2156 


F 


475333 










5346 


B 


863792 


2157 


p 


473465 










B 


865660 


2158 


F 


476279 










B 


864164 


2159 


F 


474390 








5349 


B 


866050 


2160 


F 


478446 








5350 


B 


864925 


2161 


F 


476546 








5351 


B 


866860 


2162 


F 


478869 








5352 


B 


866468 


2163 


p 


476918 










B 


868368 


2164 


F 


479441 


3759 










866860 


2165 


F 


477548 


3760 










868734 


2166 


F 


479676 


3761 






5356 


B 


867952 


2167 


F 


477775 










B 


869878 


2168 


p 


481277 










B 


869311 


2169 


p 


479377 


3764 








B 


871226 


2170 


F 


481635 












869582 


2171 


F 


479745 


3766 










871517 


2172 


F 


483172 






90884 




B 


871614 


2173 


F 


481279 










B 


873536 


2174 


F 


484659 












872804 


2175 


p 


482764 








5365 


B 


874702 


2176 


F 


485003 








5366 


B 


873093 


2177 


F 


483097 










B 


874980 


2178 


F 


487946 


3773 


B 


93567 


5368 


B 


874158 


2179 


F 


486083 




3774 


B 


92008 


5369 


B 


876056 


2180 


F 


487946 




3775 


B 


93902 




5370 


B 


874033 


2181 


F 


486093 




3776 


B 


92633 




5371 


B 


875932 



WO 99/28475 



257 



PCT/IB98/01939 



SEQ ID 


Or. 


position 




SEQ ID 




posd.ti.on 




SEQ ID 


Or . 


position 


2182 


p 


489220 












5372 


B 


874670 


2183 


p 


487274 














B 


876580 


2184 


p 














5374 


B 


875606 


2185 


p 


488388 












5375 




877506 






492138 










5376 


B 


876619 


2187 


p 


490229 














878519 


2188 


p 










5378 


B 


878160 


2189 


p 


490618 








5379 


B 


880034 


2190 


p 


493591 






i Art/no 


5380 




878429 


2191 












5381 


B 


880292 


2192 


p 


494297 






100853 


5382 


B 


879336 


2193 


p 


492436 








5383 


B 


881230 


2194 


p 


494530 






102387 


5384 


B 


879393 


2195 


p 


492679 








5385 


B 


881281 


2196 


p 


494637 








5386 


B 


880089 


2197 


p 










5387 


B 


881994 


2198 


p 


495467 








5388 


B 


881108 


2199 


p 










5389 


B 


883062 


2200 


p 


H70U / O 






104 666 


5390 


B 


882094 


2201 


p 


494126 






104 544 


5391 


B 


884013 


2202 


p 


497468 








5392 


B 


883124 


2203 


p 










5393 


B 


885027 


2204 


p 


498050 








5394 


B 


884014 


2205 












5395 


B 


885924 


2206 


p 


498994 








53 96 


B 


884530 


2207 


p 


497096 








5397 


B 


886432 


2208 


p 


500571 








5398 


B 


885226 


2209 


p 


498671 






10 918 9 


5399 


B 


887139 


2210 










111079 


5400 


B 


889010 


2211 










109684 


5401 


B 


890900 


2212 


F 


502469 


3807 


B 


111580 


5402 




390421 


2213 


F 


500572 


3808 


B 


110587 




5403 


B 


892310 


2214 


F 


503435 


3809 


B 


112486 




5404 


B 


891438 


2215 


F 


501547 


3810 


3 


112740 




5405 


B 


893287 



WO 99/28475 



258 



PCT/IB98/01939 



SEQ ID 


Or. 










position 




SEQ ID 


Or. 


position 


2216 


F 


505469 














891703 


2217 


p 


503545 
















893606 


2218 


F 


506768 
















893606 


2219 


F 


504880 
















895490 


2220 


F 


507356 














894049 


2221 


F 


505530 












B 


896024 


2222 


F 


508015 














894139 


2223 


F 


506157 


3818 










896074 


2224 


F 


508247 


3819 










894545 


2225 


F 


506351 










B 


896413 


2226 


F 


509270 


3821 








B 


894999 


2227 


F 


507356 










B 


896912 


2228 


F 


510759 


3823 










896127 


2229 


F 


508918 






124231 






898012 


2230 


F 


511268 


3825 


B 








897049 


2231 


F 


509359 


382G 










898949 


2232 


F 


512124 


3827 










901018 


2233 


F 


510202 


3828 










902955 


2234 


F 


512836 






127271 






902393 


2235 


F 


510926 


3830 


B 


492 






9043 01 


2236 


F 


514569 


3831 


B 








904098 


2237 


F 


512663 


3832 










906002 


2238 


F 


514688 


3833 




129050 






903951 


2239 


F 


512874 


3834 




128010 






905851 


2240 


F 


515334 






129905 


5430 




905825 


2241 


F 


513415 






12 994 7 




B 


907725 


2242 


F 


515516 


3837 










906700 


2243 


F 


513670 


3838 










908669 


2244 


F 


516230 


3839 










B 


907174 


2245 


F 


514348 












909066 


2246 


F 


517181 




3841 


B 


133749 




5436 


B 


907579 


2247 


F 


515261 




3842 




133381 




5437 


B 


909480 


2248 


F 


517840 




3843 


B 


135305 




5438 


B 


908962 


2249 


F 


515977 




3844 


3 


133804 




5439 


B 


910922 



WO 99/28475 



259 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


2250 


F 


518184 


2251 


F 


516283 


2252 


F 


5 1 8704 


2253 


F 


516804 


2254 


F 


519358 


2255 


F 


517404 


2256 


F 


520483 


2257 


F 


518598 


2258 


F 


521632 


2259 


F 


519765 


2260 


F 


521919 


2261 


F 


520024 


2262 


F 


523372 


2263 


F 


521495 


2264 


F 


524404 


2265 


F 


522508 


2266 


F 


525498 


2267 


F 


523600 


2268 


F 


525687 


2269 


F 


523753 


2270 


F 


526036 


2271 


F 


524164 


2272 


F 


526632 


2273 


F 


524732 


2274 


F 


528073 


2275 


F 


526158 


2276 


F 


528835 


2277 


F 


526999 


2278 


F 


529393 


2279 


F 


527504 


2280 


F 


530004 


2281 


F 


528082 


2282 


F 


530518 


2283 


F 


528594 



SEQ ID 


Or. 


position 


3845 


B 


135704 


3845 


B 


134219 


3847 


B 


136133 


3848 


B 


134729 


3849 


B 


136633 


3850 


B 


137694 


3851 


B 


139583 


3852 


B 


140496 


3853 


B 


142396 


3854 


B 


140953 


3855 


B 


142856 


3856 


B 


142031 


3857 


B 


143950 


3858 


B 


143520 


3859 


B 


145425 


3860 


B 


144066 


3861 


B 


145967 


3862 


B 


144629 


3863 


B 


146519 


3864 


B 


146547 


3865 


B 


148446 


3866 


B 


147304 


3867 


B 


149227 


3868 


B 


148296 


3869 


B 


150255 


3870 


B 


148769 


3871 


B 


150670 


3872 


B 


149320 


3873 


B 


151168 


3874 


B 


149532 






151470 


3876 


B 


149934 


3877 


B 


151845 


3878 


B 


151168 



SEQ ID 


Or. 


position 


5440 


B 


909415 


5441 


B 


911315 


5442 


B 


909804 


5443 


B 


911704 


5444 


B 


910218 


5445 


B 


912075 


5446 


B 


910834 


5447 


B 


912689 


5448 


B 


911225 


5449 


B 


913064 


5450 


B 


912492 


5451 


B 


914381 


5452 


B 


912852 


5453 


B 


914763 


5454 


B 


913438 


5455 


B 


915340 


5456 


B 


913901 


5457 


B 


915814 


5458 


B 


914730 


5459 


B 


916694 


5460 


B 


915356 


5461 


B 


917256 


5462 


B 


915717 


5463 


B 


917608 


5464 


B 


916854 


5465 


B 


918761 


5466 


B 


916762 


5467 


B 


918712 


5468 


B 


917848 


5469 


B 


919756 






918588 


5471 


B 


920424 


5472 


B 


918875 


5473 


B 


920796 



WO 99/28475 



260 



PCT/IB98/01939 



SEQID 


Or. 


position 




SEQ ID 


Or. 










position 


2284 


F 


532100 




3879 


B 


153047 










2285 


F 


530224 




3880 


B 












2286 


F 


533025 


















2287 


F 


531125 




3882 


B 


152887 








92194 9 


2288 


F 


533710 




3883 




154804 










2289 


F 


531825 




3884 












92 2328 


2290 


F 


534658 




3885 


B 


155482 








2291 


F 


532758 




3886 


B 








922978 


2292 


F 


535737 


3887 


B 


156157 








2293 


F 


533828 


3888 












2294 


F 


539456 


3 889 


B 








921740 


2295 


F 


537568 


3890 


B 


156823 








2296 


F 


540290 


3891 


B 


158731 








2297 


F 


538375 


3892 


B 


157745 








2298 


F 


540672 


3893 


B 










2299 


F 


538777 


3 894 


B 


159207 








2300 


F 


541573 


3895 


B 










2301 


F 


539706 


3896 


B 


160216 








2302 


F 


542999 


3897 


B 


162127 








2303 


F 


541102 


3898 


B 


161451 








2304 


F 


543922 


3899 


B 


163350 








2305 


F 


542057 


3900 


B 










2306 


F 


544268 


3901 


B 


163726 








2307 


F 


542354 


3902 


B 


162601 








2308 


F 


544691 


3903 


B 


164501 








2309 


F 


542785 


3904 


B 


163282 








2310 


F 


546323 


3905 


B 


165205 








2311 


F 


544441 


3906 


B 










2312 


F 


546467 


3907 


B 


166082 








2313 


F 


544650 












927489 


2314 


F 


547257 


3909 


B 


167672 


5504 


B 


926980 


2315 


F 


545357 


3910 


B 


166939 


5505 


B 


928903 


2316 


F 


547658 




3911 


B 


168861 


5506 


B 


927408 


2317 


F 


545781 




3912 


B 


168421 


5507 


B 


929331 



WO 99/28475 



261 



PCT/IB98/01939 



SEQID 


Or. 










position 




SEQ ID 


Or. 


position 


2318 


F 


548406 








170322 








927824 


2319 


F 


546466 




3914 














2320 


F 


549443 




3915 












928418 


2321 


F 


547527 




3916 














2322 


F 


550193 
















928979 


2323 


F 


548288 
















930879 


2324 


F 


551554 


3919 












930474 


2325 


F 


549665 




392 0 














2326 


F 


552877 


3921 










930879 


2327 


F 


550963 


3922 








B 


932776 


2328 


F 


554708 


3923 








B 


931298 


2329 


F 


552807 


3924 








B 


933223 


2330 


F 


556293 








5520 


B 


931886 


2331 


F 


554381 


3926 






5521 


B 


933812 


2332 


F 


557095 


3927 










932187 


2333 


F 


555164 


3928 










934117 


2334 


F 


557522 


3929 










932803 


2335 


F 


555656 


3930 








B 


934733 


2336 


F 


558021 








5526 


B 


933421 


2337 


F 


556191 


3 932 










935352 


2338 


F 


558964 


3 933 




18023 9 








2339 


F 


557076 


3934 


3 


180274 








2340 


F 


560913 


3935 










934052 


2341 


F 


559007 


3936 




180972 






935933 


2342 


F 


562832 










B 


934794 


2343 


F 


561002 










B 


936682 


2344 


F 


563374 


3939 










936606 


2345 


F 


561474 




3940 










938521 


2346 


F 


565061 












5536 


B 


938373 


2347 


F 


563198 
















940324 


2348 


F 


566840 




3943 


B 


185778 




5538 


B 


939321 


2349 


F 


564908 




3944 


B 


185007 




5539 


B 


941229 


2350 


F 


567802 




3945 


B 


186910 




5540 


B 


941153 


2351 


F 


565896 




3946 


B 


186246 




5541 


B 


943053 



WO 99/28475 



SEQID 




position 












569137 


2354 




572300 


2355 






2356 






2357 












2359 






2360 






2361 






2362 


p 


573964 


2363 


p 




2364 






2365 






2366 


p 


574704 


2367 


p 


! ! 


2368 


p 




2369 






2370 


p 




2371 


p 




2372 


p 




2373 






2374 


p 




2375 






2376 


p 




2377 




576870 


2378 






2379 




JoUU / J 


2380 






2381 






2382 


F 


582962 


2383 


F 


581060 


2384 


F 


583210 


2385 


F 


581316 | 



PCT/IB98/01939 



SEQ ID 




position 




SEQ ID 


Or . 


position 






188160 




5542 


B 


942291 






187080 




5543 


B 


944212 










5544 


B 


944964 










5545 




946809 










5546 




945527 










5547 


B 


947426 






189929 




5548 


B 


946546 










5549 


B 


948430 










5550 


B 


946896 










5551 


B 


948823 










5552 


B 


948677 










5553 




950581 










5554 


B 


949505 






192027 




5555 


B 


951398 










5556 


B 


949834 










5557 


B 


951743 










5558 


B 


950897 












B 


952796 










5560 


B 


951550 










5561 


B 


953534 










5562 


B 


951870 










5563 


B 


953763 












B 


953037 












B 


954930 










5566 


B 


954509 










5567 


B 


956384 










5568 


B 


955397 






198443 




5569 


B 


957278 






2 0 0328 




5570 


B 


958191 










5571 


B 


960060 


3977 


B 


201084 








959570 


3978 


B 


200198 




5573 


B 


961432 


3979 


B 


202084 




5574 


B 


961273 


3980 


B 


200547 




5575 


B 


963183 



- WO 99/28475 



SEQID 




position 






583564 






581697 






584048 






582179 












582897 






584978 






583031 






586256 






584320 






587527 


2397 










588892 


2399 






2400 










587452 






590516 


^2403 




588616 








2405 
















590708 






59290 1 






590973 


2410 






2411 




591990 


2412 










593484 






595935 








2416 


p 


597025 


2417 


F 


595158 


2418 


F 


598254 


2419 


F 


596402 



263 



SEQ ID 




position 


3981 




202447 






200934 






202806 






201224 






203115 






202606 






204583 






203735 






205603 


3990 


B 


204218 






206147 






2 06686 






208569 






207672 






209558 






208799 






210648 






209701 






211599 






210500 












211064 






212957 






211557 






213453 






213248 






215101 






214372 


4009 


B 


216286 






21493 1 






2168 8 7 


4012 


B 


215375 


4013 


B 


217268 


4014 


3 


216928 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


5576 


B 


961837 


5577 


B 


963749 


5578 


B 


962823 


5579 


B 


964695 


5580 


B 


962199 


5581 


B 


964099 


5582 


B 


965286 


5583 


B 


967118 


5584 


B 


965928 


5585 


B 


967828 


5586 


B 


967261 


5587 


B 


969179 


5588 


B 


968345 


5589 


B 


970181 


5590 


B 


969123 


5591 


B 


971023 


5592 


B 


970973 


5593 


B 


972849 


5594 


B 


972081 


5595 


B 


973933 


5596 


B 


973279 


5597 


B 


975173 


5598 


B 


974803 


5599 


B 


976683 


5600 


B 


975459 


5601 


B 


977346 


5602 


B 


976431 


5603 


B 


978335 


5604 


B 


976740 


5605 


B 


978640 




B 


977175 


5607 


B 


979042 


5608 


B 


977855 


5609 


B 


979768 



WO 99/28475 



264 



PCT/IB98/01939 



SEQID 


Or. 


position 




SEQ ID 


Or . 






SEQ ID 




posxtion 


2420 


F 


600515 




4015 


B 


218828 










2421 


F 


598575 




4016 












980060 


2422 


F 


601691 




4017 


B 


220037 










2423 


F 


599745 




4018 












2424 


F 


603291 


4019 




219561 








2425 


F 


601376 


4020 


B 








981104 


2426 


F 


603538 


4021 


B 


220822 








2427 


F 


601638 


4022 


B 


219229 








2428 


F 


604263 


4023 


B 










2429 


F 


602388 


4024 


B 


220005 








2430 


F 


605359 


4025 




221868 






980363 


2431 


F 


603425 


4026 


g 








9822 8 9 


2432 


F 


606622 


4027 


B 


222560 








2433 


F 


604721 


4028 


B 


221294 








2434 


F 


607722 


4029 


B 


223195 






983 818 


2435 


F 


605908 


4030 


B 


221762 






985718 


2436 


F 


609039 


4031 


B 


223673 








2437 


F 


607143 


4032 


B 


222338 






986608 


2438 


F 


609842 






224212 








2439 


F 


607958 


4034 


B 










2440 


F 


611565 


4035 


B 


225417 








2441 


F 


609657 


4036 


B 


224094 






j o O A Z 3 


2442 


F 


612719 


4037 


B 


226033 






986 92 5 


2443 


F 


610819 


4038 


B 








988825 


2444 


F 


613473 


4039 




227291 








2445 


F 


611565 


4040 


B 










2446 


F 


614096 


4041 










988354 


2447 


F 


612172 


4042 




225811 








2448 


F 


615072 


4043 


B 










2449 


F 


613207 












9903 93 


2450 


F 


615645 


4045 


B 


228871 


5640 


B 


989437 


2451 


F 


613745 


4046 


B 


227523 


5641 


B 


991305 


2452 


F 


617266 




4047 


B 


229432 




5642 


B 


993662 


2453 


F 


615347 




4048 


B 


227907 




5643 


B 


995596 



WO 99/28475 



SEQ ID 




position 






618142 




F 


616242 






619019 






617128 








2459 










620 1 82 








2462 




620929 






619019 






621446 


2465 






2466 










620292 






623244 


2469 




62 1 446 








2471 






2472 






2473 




OZJZWo 


2474 




626291 


75 




624369 








2477 






2478 










625703 


2480 




627927 






626025 






628712 








2484 


p 




2485 


F 


627685 


2486 


F 


630055 


2487 


F 


628178 



PCT/IB98/01939 



SEQ ID 




position 




SEQ ID 


Or. 


position 






229784 




5644 


B 


994015 






228 692 




5645 


B 


995906 










5646 


B 


994084 














995955 














994523 










5649 




996382 






233194 




5650 


B 


995903 






2 3322 6 




5651 


B 


997791 










5652 


B 


996831 










5653 


B 


998764 










5654 


B 


998505 










5655 


B 


1000417 










5656 


B 


999445 






235094 




5657 


B 


1001345 






236993 




5658 


B 


1001253 










5659 


B 


1003169 










5660 


B 


1001751 






23 844 0 




5661 




1003637 










5662 




1001954 






23 8 989 




5663 




1003786 










5664 




1004003 






2402 94 




5665 


B 


1005872 












B 


1005114 






242260 




5667 


B 


1006999 










5668 


B 


1005620 






243066 




5669 


B 


1007561 






245029 




5670 


B 


1007761 






244703 




5671 


B 


1009618 






246603 




5672 


B 


1008052 










5673 


B 


1009941 


4079 


B 


248017 








1008954 


4080 


B 


247104 




5675 


B 


1010818 


4081 


B 


248997 




5676 


B 


1009679 


4082 


B 


248001 




5677 


B 


1011621 



WO 99/2847S 



SEQID 




position 








2489 




628789 








2491 






2492 




637807 


2493 


p 




2494 




639960 






638102 


2496 


p 










2498 






2499 


p 




2500 




&AA\ Af, 


2501 






2502 


p 










2504 






2505 






2506 




646869 


2507 




644983 


2508 




649714 


2509 




OH /oUv 


2510 




A^n i oq 


251 1 




548260 


2512 






2513 






2514 




)52285 


2515 


p 




2516 










350653 


2518 


F 


659653 


2519 


F 


657741 


2520 


F 


661449 


2521 


F 


659621 



266 



SEQ ID 




position 






249872 






248835 


4085 




250713 






249697 






251574 






250643 






252560 






251439 






253306 






252401 






254231 






253312 






255224 






256969 






258840 






258065 






259964 






258945 






260826 






259428 






261313 






261845 






263746 






264350 






266263 






264963 






266887 






265848 


4111 


B 


267749 






266644 








4114 


B 


268188 


4115 


B 


270091 


4116 


B 


268188 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


5678 


B 


1012274 


5679 


B 


1014193 


5680 


B 


1012682 


5681 


B 


1014550 


5682 


B 


1012855 


5683 


B 


1014755 


5684 


B 


1013104 


5685 


B 


1014996 


5686 


B 


1013698 


5687 


B 


1015571 


5688 


B 


1014289 


5689 


B 


1016181 


5690 


B 


1014730 


5691 


B 


1016653 


5692 


B 


1015459 


5693 


B 


1017377 


5694 


B 


1016272 


5695 


B 


1018122 


5696 


B 


1017377 


5697 


B 


1019285 


5698 


B 


1018043 


5699 


B 


1019888 


5700 


B 


1019146 


5701 


B 


1021064 


5702 


B 


1019421 


5703 


B 


1021323 


5704 


B 


1020440 


5705 


B 


1022319 


5706 


B 


1021269 


5707 


B 


1023161 






1021789 


5709 


B 


1023722 


5710 


B 


1022638 


5711 


B 


1024535 



WO 99/28475 
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PCT/IB98/01939 



SEQID 


Or. 










posxtn.cn 




SEQ ID 




positxon 


2522 


F 


662058 




4117 




70091 








1023 900 


2523 


F 


660167 




4118 


B 












2524 


F 


662726 




4119 


B 


270440 








1024169 


2525 


F 


660831 




4120 


B 


268785 








1026083 


2526 


F 


663526 
















1024996 


2527 


F 


661591 




4122 




270439 








102 693 8 


2528 


F 


664035 




4123 


B 


272288 










2529 


F 


662134 




4124 


B 










1027165 


2530 


F 


665504 




4125 


B 


273 043 






1026136 


2531 


F 


663591 


4126 


B 








1028031 


2532 


F 


665530 


4127 


B 


273738 






1026823 


2533 


F 


663663 


4128 


B 








1028699 


2534 


F 


666606 


4129 










1027642 


2535 


F 


664705 


4130 




273817 






1029524 


2536 


F 


667901 


4131 


B 








1030824 


2537 


F 


665987 


4132 


B 


275062 








2538 


F 


667953 


4133 


B 


277005 








2539 


F 


666037 


4134 


B 


275886 








2540 


F 


668397 


4135 




2 7 7 8 5 7 






1032306 


2541 


F 


666496 


4136 


3 


277923 






1034209 


2542 


F 


669235 


4137 


B 








1037275 


2543 


F 


667366 


4138 


B 


279043 








2544 


F 


669763 


4139 


B 


281004 






103 613 0 


2545 


F 


667879 


4140 


B 


280126 






103 8037 


2546 


F 


670513 


4141 




282004 






1036727 


2547 


F 


668612 


4142 




2 81004 








2548 


F 


670963 


4143 


B 










2549 


F 


669058 


4144 


B 


281825 








2550 


F 


672473 


4145 












17209 


2551 


F 


670573 














19109 


2552 


F 


672985 




4147 


B 


284236 




5742 


B 


32032 


2553 


F 


671072 




4148 


B 


284583 




5743 


B 


33899 


2554 


F 


674381 




4149 


B 


286455 




5744 


B 


57057 


2555 


F 


672493 




4150 


B 


285055 




5745 


B |58954 
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SEQ ID 


Or. 






SEQ ID 




position 


2556 




675029 










2557 


p 


673135 










2558 


p 












2559 


p 


674357 










2560 


p 


677481 










2561 


p 


675559 










2562 


p 


678423 








288811 


2563 


p 










288145 


2564 


p 


679049 










2565 


p 










289448 


2566 


p 


680360 










2567 


p 


678423 










2568 


p 










292805 


2569 


p 


679420 








291814 


2570 


p 


682281 








293655 


2571 


p 


680435 










2572 


p 


682870 










2573 


p 


681012 










2574 


p 










295183 


2575 


p 


682281 










2576 


p 


684582 










2577 


p 


682664 










2578 


F 


685978 










2579 


p 


684033 










2580 


F 


687121 










2581 


F 


685186 










2582 


p 










299413 


2583 


F 


686044 








298970 


2584 


p 


588 169 








300855 


2585 


p 












2586 


F 


689393 




4181 


B 


302595 


2587 


F 


687511 




4182 


B 


302372 


2588 


F 


689580 




4183 


B 


304272 


2589 


F 


687727 




4184 


B 


305137 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


5746 


B 


57141 


5747 


B 


59033 


5748 


B 


65406 


5749 


B 


67210 


5750 


B 


73871 


5751 


B 


75741 


5752 


B 


78956 


5753 


B 


80903 


5754 


B 


80042 


5755 


B 


81944 


5756 


B 


82642 


5757 


B 


84491 


5758 


B 


87820 


5759 


B 


89658 


5760 


B 


110184 


5761 


B 


112086 


5762 


B 


111873 


5763 


B 


113837 


5764 


B 


112302 


5765 


B 


114206 


5766 


B 


113165 


5767 


B 


115093 


5768 


B 


114270 


5769 


B 


116158 


5770 


B 


121039 


5771 


B 


122904 


5772 


B 


125742 


5773 


B 


127643 


5774 


B 


132170 


5775 


B 


134028 






144647 


5777 


B 


146547 


5778 


B 


150960 


5779 


B 


152837 
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PCT/IB98/01939 



SEQ ID 


Or. 


position 












SEQ ID 




position 


2590 


F 


690204 








3 07039 








164761 


2591 


F 


688342 
















166686 


2592 


F 


69043 1 




4187 












166362 


2593 


F 


688497 


















2594 


F 


691790 




4189 














2595 


F 


689919 




4190 














2596 


F 


693614 




4191 




309120 








171056 


2597 


F 


691704 




4192 




3 09018 






173021 


2598 


F 


694723 


4193 










177747 


2599 


F 


692821 








5789 




179629 


2600 


F 


696922 












188605 


2601 


F 


695033 


4196 










190552 


2602 


F 


697714 


4197 




312899 






189016 


2603 


F 


695816 


4198 










190924 


2604 


F 


698510 


4199 










190871 


2605 


F 


696612 


4200 










192749 


2606 


F 


700037 


4201 










197533 


2607 


F 


698119 


4202 


B 


314590 






199449 


2608 


F 


700691 


4203 


B 










2609 


F 


698783 


4204 


B 


3 14977 








2610 


F 


701885 


4205 


B 


316880 






235455 


2611 


F 


699984 


4206 










237385 


2612 


F 


703303 












237448 


2613 


F 


701403 


4208 










239387 


2614 


F 


704791 


4209 










250266 


2615 


F 


702877 


4210 












2616 


F 


705452 


4211 










253731 


2617 


F 


703584 


4212 










255663 


2618 


F 


705918 








5808 


B 


255115 


2619 


F 


7040 1 9 












B 


256969 


2620 


F 


706241 




4215 


B 


320628 




5810 


B 


272158 


2621 


F 


704322 




4216 


B 


318094 




5811 


B 


274093 


2622 


F 


707833 




4217 


B 


320048 




5812 


B 


276317 


2623 


F 


705939 




4218 


B 


319182 




5813 


B 


278190 
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PCT/IB98/01939 



SEQID 


Or. 










position 




SEQ ID 




position 


2624 


F 


708029 


















2625 


F 


706086 








3 2 04 04 








280366 


2626 


F 


708653 








J I o 




5816 




283005 


2627 


F 








321720 








284873 


2628 


F 


7 1 0042 














293718 


2629 


F 


708142 














295643 


2630 


F 


71 1185 














303690 


2631 


F 


709291 






322582 


5821 




305624 


2632 


F 


712521 












309538 


2633 


F 


710641 










B 


311476 


2634 


F 


713432 








5824 


B 


312791 


2635 


F 


711506 








5825 


B 


314685 


2636 


F 


713901 








5826 


B 


313073 


2637 


F 


711987 


4232 








B 


314977 


2638 


F 


714557 












313506 


2639 


F 


712708 










B 


315343 


2640 


F 


715339 








5830 




320823 


2641 


F 


713437 












322730 


2642 


F 


715702 










B 


340723 


2643 


F 


713761 


4238 










342638 


2644 


F 


716892 






33 0270 






353562 


2645 


F 


714970 












355444 


2646 


F 


718240 










B 


373944 


2647 


F 


716381 










B 


375838 


2648 


F 


7 1 8240 










B 


377997 


2649 


F 


716380 










B 


379877 


2650 


F 


719563 








5840 


B 


379877 


2651 


F 


717658 






33 08 86 


5841 


B 


381778 


2652 


F 


719916 










B 


395318 


2653 


F 












B 


397228 


2654 


F 


720346 


4249 


B 


333375 


5844 


B 


401846 


2655 


F 


718429 


4250 


B 


331990 


5845 


B 


403782 


2656 


F 


721306 




4251 


B 


333884 




5846 


B 


410759 


2657 


F 


719440 




4252 


B 


332669 




5847 


B 


412677 
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SEQ ID 


Or. 


position 




SEQ ID 




position 


2658 


F 


722178 










2659 


p 












2660 


p 


1^9 








335166 


2661 


p 












2662 


p 


724357 










2663 


p 


72245 1 










2664 


p 


725491 










2665 


p 












2666 


p 


726312 










2667 


p 


724417 










2668 


F 


726526 










2669 


p 


724590 










2670 


F 


727245 










2671 


F 


725325 










2672 


p 


728081 










2673 


p 


726209 










2674 


F 


728510 










2675 


p 


726618 










2676 


p 


729214 










2677 


p 


727319 










2678 


p 


733006 








342110 


2679 


p 










342069 


2680 


F 


734566 










2681 


p 


732618 










2682 


p 


735410 










2683 


p 


733522 










2684 


F 










344593 


2685 


p 


734388 








343594 


2686 


F 


736969 










2687 


p 












2688 


F 


737678 




4283 


B 


346199 


2689 


F 


735820 




4284 


B 


344452 


2690 


F 


738618 




4285 


B 


346382 


2691 


F 


736719 




4286 


B 


345362 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


5848 


B 


411878 


5849 


B 


413779 


5850 


B 


415199 


5851 


B 


417099 


5852 


B 


423479 


5853 


B 


425332 


5854 


B 


428421 


5855 


B 


430332 


5856 


B 


429678 


5857 


B 


431571 


5858 


B 


443036 


5859 


B 


444947 


5860 


B 


444280 


5861 


B 


446161 


5862 


B 


443964 


5863 


B 


445811 


5864 


B 


446392 


5865 


B 


448276 


5866 


B 


468498 


5867 


B 


470382 


5868 


B 


472328 


5869 


B 


474285 


5870 


B 


488594 


5871 


B 


490459 


5872 


B 


497914 


5873 


B 


499837 


5874 


B 


500718 


5875 


B 


502596 


5876 


B 


509811 


5877 


B 


511702 








5879 


B 


513385 


5880 


B 


527090 


5881 


B 


529014 
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PCT/IB98/01939 



SEQID 


Or. 


position 




SEQ ID 


Or . 


position 




SEQ ID 




pos i t ion 


2692 


F 


739841 




4287 


B 












2693 


p 


737941 


















2694 


F 


741797 




4289 




348069 










2695 


F 


739895 




4290 














2696 


F 


742583 


4291 


B 


349228 








2697 


F 


740705 


4292 


B 


348165 






567099 


2698 


F 


743087 


4293 


B 


350060 








2699 


F 


741169 


4294 










569355 


2700 


F 


744211 


4295 


3 


352288 








2701 


F 


742277 


4296 












2702 


F 


744391 


4297 




353403 








2703 


F 


742492 


4298 


B 










2704 


F 


744739 


4299 


B 


354356 






576288 


2705 


F 


742813 


4300 


B 


352948 








2706 


F 


745342 


4301 




354901 






5 90890 


2707 


F 


743438 


4302 


B 


53959 








2708 


F 


746197 


4303 


B 










2709 


F 


744273 


43 04 


B 


354438 








2710 


F 


746725 


4305 


B 


356378 








2711 


F 


744784 


4306 


B 


354997 






60 9 012 


2712 


F 


748041 


4307 


B 


356866 






60 8260 


2713 


F 


746142 


43 08 












2714 


F 


748723 


4309 




358793 








2715 


F 


746795 


4310 


B 










2716 


F 


749171 


4311 


B 


359499 








2717 


F 


747310 


4312 


B 


358323 








2718 


F 


749475 


4313 


B 










2719 


F 


747568 


4314 


B 


360972 








2720 


F 


749761 


4315 










650757 


2721 


F 


747855 
















2722 


F 


752284 




4317 


B 


363263 


5912 


B 


652808 


2723 


F 


750384 




4318 


B 


362109 


5913 


B 


654682 


2724 


F 


753397 




4319 


B 


364008 




5914 


B 


655545 


2725 


F 


751497 




4320 


B 


362983 




5915 


B 


657446 
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SEQ ID 




position 




SEQ ID 




position 


2726 


p 










364867 


2727 


p 












2728 












3 66002 


2729 












365415 


2730 












367338 


2731 


p 


756276 








365807 


2732 


p 












2733 


p 










367607 


2734 


p 










369440 


2735 


p 










368881 


2736 












370788 


2737 












369317 


2738 


p 










371209 


2739 


p 


758798 








370522 


2740 


p 


761489 








372440 


2741 


p 


759589 










2742 


p 










373206 


2743 


p 










373097 


2744 


p 










374941 


2745 


p 


761215 








373753 


2746 


p 


764209 










2747 












374424 


2748 












376324 


2749 












374956 


2750 












376888 


2751 


p 










376611 


2752 












378511 


2753 


p 


/OHoVO 








377297 


2754 












379209 






766063 








378960 


2756 


F 


768572 




4351 




3 8 0 8 8 0 


2757 


F 


766671 




4352 


B 


379309 


2758 


F 


769873 




4353 


B 


381180 


2759 


F 


768006 




4354 


B 


379667 



PCT/IB98/01939 



SEQ ID 


Or. 


position 


5916 


B 


661392 


5917 


B 


663292 


5918 


B 


677837 


5919 


B 


679716 


5920 


B 


679748 


5921 


B 


681674 


5922 


B 


732909 


5923 


B 


734756 


5924 


B 


742639 


5925 


B 


744503 


5926 


B 


759613 


5927 


B 


761510 


5928 


B 


760782 


5929 


B 


762671 


5930 


B 


771617 


5931 


B 


773519 


5932 


B 


772628 


5933 


B 


774528 


5934 


B 


788703 


5935 


B 


790577 


5936 


B 


816591 


5937 


B 


818443 


5938 


B 


847145 


5939 


B 


849042 


5940 


B 


868276 


5941 


B 


870177 


5942 


B 


875887 


5943 


B 


877779 


5944 


B 


877137 


5945 


B 


879035 


5946 


B 


884780 


5947 


B 


886680 


5948 


B 


892172 


5949 


B 


894073 



WO 99/28475 



274 



PCT/IB98/01939 



SEQID 


Or. 






SEQ ID 




position 




SEQ ID 




position 


2760 


F 


769966 




4355 


B 












2761 


F 


768060 




4356 


B 


3 8 023 8 








902955 


2762 


F 


770411 




4357 


B 










SUZ / D u 


2763 


F 


768455 




4358 


B 


381699 








904687 


2764 


F 


771103 


4359 


B 


383615 








908266 


2765 


F 


769211 






B 










910218 


2766 


F 


771980 


















2767 


F 


770116 


















2768 


F 


773176 






3 


385837 








935988 


2769 


F 


771305 






B 












2770 


F 


773937 






B 


3 86065 










2771 


F 


771980 






B 


385479 










2772 


F 


776399 






B 


387365 








953426 


2773 


F 


774514 






B 










955397 


2774 


F 


776672 






B 


387635 










2775 


F 


774773 






B 


387115 










2776 


F 


777446 






B 


389019 








969548 


2777 


F 


775596 






B 


386903 










2778 


F 


779102 






B 












2779 


F 


777192 






B 












2780 


F 


781078 






B 


389504 










2781 


F 


779148 






B 












2782 


F 


782192 






B 


390055 










2783 


F 


780236 




4378 


B 


388524 










2784 


F 


785250 




4379 


B 












2785 


F 


783413 




4380 


B 


389428 








1000774 


2786 


F 


785324 




4381 












1004572 


2787 


F 


783427 




4382 












1006449 


2788 


F 


786392 




4383 












1010507 


2789 


F 


784488 




4384 


B 


391321 




5979 


B 


1012353 


2790 


F 


787401 




4385 


B 


393147 




5980 


B 


1029707 


2791 


F 


785488 




4386 


B 


392032 




5981 


B 


1031628 


2792 


F 


787693 




4387 


B 


393943 
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1. An isolated polynucleotide having a nucleotide sequence of a Chlamydia 
trachomatis genome, comprising 

(a) the nucleotide sequence of SEQ ID No. 1; 

(b) the nucleotide sequence contained within the Chlamydia trachomatis genomic 
DNA in ECACC Deposit No. 981 12618; 

(c) the nucleotide sequence contained in a clone insert in ECACC Deposit No. 
98112617; 

(d) a nucleotide sequence exhibiting at least 99.9% identity with the sequence of 
SEQ ID No. l;or 

(e) a nucleotide sequence exhibiting at least 80% homology to SEQ ID No. 1. 



2. An isolated polynucleotide which hybridizes to SEQ ID No. 1 or to the 
Chlamydia trachomatis genomic DNA contained in ECACC Deposit No. 98112618 or to a clone 
insert in ECACC Deposit No. 981 126 17 under conditions of high stringency . 

3. An isolated polynucleotide which hybridizes to SEQ ID No. 1 or to the 
Chlamydia trachomatis genomic DNA contained in ECACC Deposit No. 98112618 
under conditions of intermediate stringency. 

4. An isolated polynucleotide having a nucleotide sequence of an open reading 
frame (ORE) of a Chlamydia trachomatis genome, comprising: 

(a) a nucleotide sequence chosen from one of ORF2 to ORE 1 197; 

(b) a nucleotide sequence exhibiting at least 99.9% identity with one of ORF2 to 
ORE 1197; or 

(c) a nucleotide sequence exhibiting at least 80% homology to one of ORF2 to 
ORF1197. 



5. An isolated polynucleotide which hybridizes to one of ORF2 to ORF 1197 
under conditions of high stringency. 

6. An isolated polynucleotide which hybridizes to one of ORF2 to ORF 1197 
under conditions of intermediate stringency. 

7. The polynucleotide of Claim 2, 3, 4, 5, or 6 which encodes the following 
polypeptides or fragments thereof: 



WO 99/28475 PCT/IB98/01939 
281 

(a) a Chlamydia trachomatis transmembrane polypeptide having between 1 
and 3 transmembrane domains; 

(b) a Chlamydia trachomatis transmembrane polypeptide having between 4 and 6 
transmembrane domains; 

(c) a Chlamydia trachomatis transmembrane polypeptide having at least 7 
transmembrane domains; 

(d) a Chlamydia trachomatis polypeptide involved in intermediate metabolism of 
sugars and/or cofactors; 

(e) a Chlamydia trachomatis polypeptide involved in intermediate metabolism of 
nucleotides or nucleic acids; 

(f) a Chlamydia trachomatis polypeptide involved in metabolism of amino acids 
or polypeptides; 

(g) a Chlamydia trachomatis polypeptide having involved in metabolism of fatty 
acids; 

(h) a Chlamydia trachomatis polypeptide involved in the synthesis of the cell 
wall; 

(i) a Chlamydia trachomatis polypeptide involved in transcription, translation, 
and/or maturation process; 

(j) a Chlamydia trachomatis transport polypeptide; 

(k) a Chlamydia trachomatis polypeptide involved in the virulence process; 
(1) a Chlamydia trachomatis polypeptide involved in the secretory system and/or 
which is secreted; 

(m) a Chlamydia trachomatis polypeptide of the cellular envelope or outer cellular 

envelope of Chlamydia trachomatis. 
(n) a Chlamydia trachomatis surface exposed polypeptide; 
(o) a Chlamydia trachomatis lipoprotein; 

(p) a Chlamydia trachomatis polypeptide involved in lipopolysaccharide 
biosynthesis; 

(q) a Chlamydia trachomatis KDO-related polypeptide; 
(r) a Chlamydia trachomatis phosphomannomutase-related polypeptide; 
(s) a Chlamydia trachomatis phosphoglucomutase-related polypeptide; 
(t) a Chlamydia trachomatis lipid A component-related polypeptide; 
(u) a Chlamydia trachomatis polypeptide that contains an RGD sequence; 
(v) a Chlamydia trachomatis Type III secreted polypeptide; 
(w) a Chlamydia trachomatis cell wall anchored surface polypeptide; or 
(x) a Chlamydia trachomatis polypeptide that is not found in Chlamydia 
trachomatis. 
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8. A polynucleotide encoding a fusion protein, comprising one of ORP2 to ORF 
1 197 of Claim 4, 5, or 6 ligated in frame to a polynucleotide encoding a heterologous polypeptide. 

9. A recombinant vector that contains the polynucleotide of Claim 1, 2, 3, 4, 5 or 



10. A recombinant vector that contains the polynucleotide of Claim 8 . 

11. A recombinant vector that contains the polynucleotide of Claim 4, 5 or 6, 
operatively associated with a regulatory sequence that controls gene expression. 

12. A recombinant vector that contains the polynucleotide of Claim 8 operatively 
associated with a regulatory sequence that controls gene expression. 

13. A genetically engineered host cell that contains the polynucleotide of Claim 1 , 

2, 3, 4, 5 or 6. 



14. A genetically engineered host cell that contains the polynucleotide of Claim 8. 

15. A genetically engineered host cell that contains the polynucleotide of Claim 4, 
5 or 6 operatively associated with a regulatory sequence that controls gene expression in the host cell. 

16. A genetically engineered host cell that contains the polynucleotide of Claim 8 
operatively associated with a regulatory sequence that controls gene expression in the host cell. 

17. A method for producing a polypeptide, comprising: 

(a) culturing the genetically engineered host cell of Claim 15 under conditions 
suitable to produce the polypeptide encoded by the polynucleotide; and 

(b) recovering the polypeptide from the culture. 



18. A method for producing a fusion protein, comprising: 

(a) culturing the genetically engineered host cell of Claim 16 under conditions 
suitable to produce the fusion protein encoded by the polynucleotide; and 

(b) recovering the fusion protein from the culture. 



19. 



A polypeptide encoded by the polynucleotide of Claim 4, 5 or 6. 
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20. The polypeptide of Claim 19 which immunoreacts with seropositive serum of 
an individual infected with Chlamydia trachomatis. 

21. The polypeptide of Claim 19 which comprises the following polypeptides or 
fragments thereof: 

(a) a Chlamydia trachomatis transmembrane polypeptide having between 1 and 3 
transmembrane domains; 

(b) a Chlamydia trachomatis transmembrane polypeptide having between 4 and 6 
transmembrane domains; 

(c) a Chlamydia trachomatis transmembrane polypeptide having at least 7 
transmembrane domains; 

(d) a Chlamydia trachomatis polypeptide involved in intermediate metabolism of 
sugars and/or cofactors; 

(e) a Chlamydia trachomatis polypeptide involved in intermediate metabolism of 
nucleotides or nucleic acids; 

(f) a Chlamydia trachomatis polypeptide involved in metabolism of amino acids 
or polypeptides; 

(g) a Chlamydia trachomatis polypeptide involved in metabolism of fatty acids; 

(h) a Chlamydia trachomatis polypeptide involved in the synthesis of the cell 
wall; 

(i) a Chlamydia trachomatis polypeptide involved in transcription, translation, 
and/or maturation process; 

(j) a Chlamydia trachomatis transport polypeptide; 
(k) a Chlamydia trachomatis polypeptide involved in the virulence process; 
(1) a Chlamydia trachomatis polypeptide involved in the secretory system and/or 
which is secreted; 

(m) a Chlamydia trachomatis polypeptide of the cellular envelope or outer cellular 

envelope of Chlamydia trachomatis. 
(n) a Chlamydia trachomatis surface exposed polypeptide; 
(o) a Chlamydia trachomatis lipoprotein; 

(p) a Chlamydia trachomatis polypeptide involved in lipopolysaccharide 
biosynthesis; 

(q) a Chlamydia trachomatis KDO-related polypeptide; 
(r) a Chlamydia trachomatis phosphomannomutase-related polypeptide; 
(s) a Chlamydia trachomatis phosphoglucomutase-related polypeptide; 
(t) a Chlamydia trachomatis lipid A component-related polypeptide; 
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(u) a Chlamydia trachomatis polypeptide that contains an RGD sequence; 
(v) a Chlamydia trachomatis Type III secreted polypeptide; 
(w) a Chlamydia trachomatis cell wall anchored surface polypeptide; or 
(x) a Chlamydia trachomatis polypeptide that is not found in Chlamydia 
trachomatis. 



22. A fusion protein encoded by the polynucleotide of Claim 8. 



23. The fusion protein of Claim 22 which immunoreacts with seropositive serum 
of an individual infected with Chlamydia trachomatis. 



An antibody that immunospecifically binds to the polypeptide of Claim 19. 
An antibody that immunospecifically binds to the fusion protein of Claim 22. 



26. A method for the detection and/or identification of Chlamydia trachomatis in 
a biological sample, comprising: 

(a) contacting the sample with a polynucleotide primer of Claim 1, 2, 3, 4, 5, or 6 
in the presence of a polymerase enzyme and nucleotides under conditions 
which permit primer extension; and 

(b) detecting the presence of primer extension products in the sample in which the 
detection of primer extension products indicates the presence of Chlamydia 
trachomatis in the sample. 



27. A method for the detection and/or identification of Chlamydia trachomatis in 
a biological sample, comprising: 

(a) contacting the sample with a polynucleotide probe of Claim 1, 2, 3, 4, 5, or 6 
under conditions which permit hybridization of complementary base pairs; and 

(b) detecting the presence of hybridization complexes in the sample in which the 
detection of hybridization complexes indicates the presence of Chlamydia 
trachomatis in the sample. 



28. A method for the detection and/or identification of Chlamydia trachomatis in 
a biological sample, comprising: 

(a) contacting the sample with the antibody of Claim 24 under conditions suitable 
for the formation of immune complexes; and 
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(b) 



detecting the presence of immune complexes in the sample, in which the 
detection of immune complexes indicates the presence of Chlamydia 
trachomatis in the sample. 



29. 



A method for the detection and/or identification of antibodies to Chlamydia 



trachomatis in a biological sample, compnsing: 



(a) contacting the sample with a polypeptide of Claim 19 under conditions 
suitable for the formation of immune complexes; and 

(b) detecting the presence of immune complexes in the sample, in which the 
detection of immune complexes indicates the presence of Chlamydia 
trachomatis in the sample. 



30. A DNA chip containing an array of polynucleotides comprising at least one of 
the polynucleotides of Claim 1, 2, 3, 4, 5, or 6. 

31. A protein chip containing an array of polypeptides comprising at least one of 
the polypeptides of Claim 19. 

32. An immunogenic composition comprising the polypeptide of Claim 19 and a 
pharmaceutically acceptable carrier. 

33. An imrnunogeneic composition comprising the polypeptide of Claim 20 and a 
pharmaceutically acceptable carrier. 

34. An immunogenic composition comprising the fusion protein of Claim 22 and 
a pharmaceutically acceptable earner. 

35. An immunogenic composition comprising the fusion protein of Claim 23 and 
a pharmaceutically acceptable carrier. 

36. A pharmaceutical composition comprising the polypeptide of Claim 19 and a 
pharmaceutically acceptable carrier. 



37. A pharmaceutical composition comprising the polypeptide of Claim 20 and a 
pharmaceutically acceptable carrier. 



WO 99/28475 PCT/IB98/01939 
286 

38. A pharmaceutical composition comprising the polypeptide of 
Claim 22 and a pharmaceutical^ acceptable carrier. 

39. A pharmaceutical composition comprising the polypeptide of Claim 23 and a 
pharmaceutical^ acceptable carrier. 



40. A method of immunizing against Chlamydia trachomatis, comprising: 
administering to a host an immunizing amount of the immunogenic composition of Claim 32. 

41. A method of immunizing against Chlamydia trachomatis, comprising: 
administering to a host an immunizing amount of the immunogenic composition of Claim 33. 

42. A method of immunizing against Chlamydia trachomatis, comprising 
administering to a host an immunizing amount of the immunogenic composition of Claim 34. 

43. A method of immunizing against Chlamydia trachomatis, comprising: 
administering to a host an immunizing amount of the immunogenic composition of Claim 35. 

44. A DNA immunogenic composition comprising the expression vector of Claim 

11. 



45. The DNA composition of Claim 44, wherein the DNA composition directs the 
expression of a neutralizing epitope of Chlamydia trachomatis. 

46. A DNA immunogenic composition comprising the expression vector of Claim 

12. 



47. The DNA composition of Claim 46, wherein the DNA composition directs the 
expression of a neutralizing epitope of Chlamydia trachomatis. 



48. A screening assay, comprising: 

(a) contacting a test compound with an isolated polynucleotide of Claim 1,2,3, 4, 
5 or 6; and 

(b) detecting whether binding occurs. 



49. A screening assay, comprising: 

(a) contacting a test compound with the polypeptide of Claim 1 9; and 
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(b) detecting whether binding occurs. 

50. A screening assay, comprising: 

(a) contacting a test compound with the polypeptide of Claim 22; and 

(b) detecting whether binding occurs. 

51. A kit comprising a container containing an isolated polynucleotide of Claim 1 , 

2, 3, 4, 5 or 6. 

52. The kit of Claim 51 wherein the polynucleotide is a primer or a probe. 

53. The kit of Claim 5 1 wherein the polynucleotide is a primer and the kit further 
comprises a container containing a polymerase. 

54. The kit of Claim 51 which further comprises a container containing 
deoxynucleotide triphosphates. 

55. A kit comprising a container containing an antibody that immunospecifically 
binds to the polypeptide of Claim 19. 

56. A kit comprising a container containing an antibody that immunospecifically 
binds to the fusion protein of Claim 22. 
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